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Chapter 1 



The Campbell Baker 
Hausdorff Formula 

1.1 The problem. 

Recall the power series: 

cxpX = 1 + X + l -X 2 + ix 3 + • • • , log(l + X) = X - l -X 2 + l -X* + • • • . 

We want to study these series in a ring where convergence makes sense; for ex- 
ample in the ring ofnxn matrices. The exponential series converges everywhere, 
and the series for the logarithm converges in a small enough neighborhood of 
the origin. Of course, 

log(cxpX) - X- exp(log(l + X)) = 1 + X 

where these series converge, or as formal power series. 

In particular, if A and B are two elements which are close enough to we 
can study the convergent series 

log[(exp^)(ex P B)] 

which will yield an element C such that expC = (exp A)(exp B). The problem 
is that A and B need not commute. For example, if we retain only the linear 
and constant terms in the series we find 

log[(l + A + ■■■ )(1 + B +•••)] = log(l + A + B + ■■■) = A + B + -- - . 

On the other hand, if we go out to terms second order, the non-commutativity 
begins to enter: 

\og[{l + A+ l -A 2 + ■■■){! + B+ l -B 2 + •••)] = 
7 
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A + B + l -A 2 + AB + ^B 2 + B + ■ • • ) 2 

= A + B+ l -[A,B} + --- 

where 

[A, B] := AB - BA (1.1) 

is the commutator of A and B, also known as the Lie bracket of A and B. 

Collecting the terms of degree three we get, after some computation, 

^ (A 2 B + AB 2 + B 2 A + BA 2 - 2ABA — 2BAB]) = ^[A,[A,B]]+^[B,[B,A]]. 

This suggests that the series for log[(exp A)(exp B)] can be expressed entirely 
in terms of successive Lie brackets of A and B. This is so, and is the content of 
the Campbcll-Baker-Hausdorff formula. 

One of the important consequences of the mere existence of this formula is 
the following. Suppose that g is the Lie algebra of a Lie group G. Then the local 
structure of G near the identity, i.e. the rule for the product of two elements of 
G sufficiently closed to the identity is determined by its Lie algebra g. Indeed, 
the exponential map is locally a diffcomorphism from a neighborhood of the 
origin in g onto a neighborhood W of the identity, and if U C W is a (possibly 
smaller) neighborhood of the identity such that U ■ U C W, the the product of 
a = exp£ and b = exprj, with a e U and b e U is then completely expressed in 
terms of successive Lie brackets of £ and r\. 

We will give two proofs of this important theorem. One will be geometric - 
the explicit formula for the series for log[(exp ^4)(exp B)] will involve integration, 
and so makes sense over the real or complex numbers. We will derive the formula 
from the "Maurer-Cartan equations" which we will explain in the course of our 
discussion. Our second version will be more algebraic. It will involve such ideas 
as the universal enveloping algebra, comultiplication and the Poincare-Birkhoff- 
Witt theorem. In both proofs, many of the key ideas are at least as important 
as the theorem itself. 

1.2 The geometric version of the CBH formula. 

To state this formula we introduce some notation. Let ad A denote the operation 
of bracketing on the left by A, so 

adA(B) := [A, B] . 

Define the function ip by 

zlogz 
*{z) = — 

which is defined as a convergent power series around the point z — 1 so 

, /, \ ,log(l + u) .„ u u 2 . „ u u 2 

iKl + «) = (1 + u) -^i- 1 = (1 + «)(l __ + _ + ...) = 1 + ___ + ... . 
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In fact, we will also take this as a definition of the formal power series for ip in 
terms of u. The Campbell-Baker-Hausdorff formula says that 



into the definition of ip, apply this operator to the clement B and then integrate. 
In carrying out this computation we can ignore all terms in the expansion of t/j 
in terms of ad A and ad B where a factor of ad B occurs on the right, since 
(ad B)B = 0. For example, to obtain the expansion through terms of degree 
three in the Campbell-Baker-Hausdorff formula, we need only retain quadratic 
and lower order terms in u, and so 



where the dots indicate either higher order terms or terms with ad B occurring 
on the right. So up through degree three (1.2) gives 



log(exp A) (exp B) = A + B + \[A, B] + 1 [A, [A, B}] - 1 [B, [A, B]] + ■■ ■ 



agreeing with our preceding computation. 

2. The meaning of the exponential function on the left hand side of the 
Campbell-Baker-Hausdorff formula differs from its meaning on the right. On 
the right hand side, exponentiation takes place in the algebra of cndomorphisms 
of the ring in question. In fact, we will want to make a fundamental rcinter- 
prctation of the formula. We want to think of A, B, etc. as elements of a Lie 
algebra, g. Then the exponentiations on the right hand side of (1.2) are still 
taking place in End(g). On the other hand, if g is the Lie algebra of a Lie group 
G, then there is an exponential map: exp: g — > G, and this is what is meant by 
the exponentials on the left of (1.2). This exponential map is a diffcomorphism 
on some neighborhood of the origin in g, and its inverse, log, is defined in some 
neighborhood of the identity in G. This is the meaning we will attach to the 
logarithm occurring on the left in (1.2). 

3. The most crucial consequence of the Campbell-Baker-Hausdorff formula 
is that it shows that the local structure of the Lie group G (the multiplication 
law for elements near the identity) is completely determined by its Lie algebra. 

4. For example, we see from the right hand side of (1.2) that group multi- 
plication and group inverse are analytic if we use exponential coordinates. 




(1.2) 



Remarks. 

1 . The formula says that we are to substitute 



u — (exp ad A) (exp ted B) — 1 
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5. Consider the function r defined by 

This is a familiar function from analysis, as it enters into the Euler-Maclaurin 
formula, see below. (It is the exponential generating function of (— l) k bk where 
the bk are the Bernoulli numbers.) Then 

tp(z) = r(logz). 



6. The formula is named after three mathematicians, Campbell, Baker, and 
Hausdorff. But this is a misnomer. Substantially earlier than the works of any 
of these three, there appeared a paper by Friedrich Schur, "Neue Begruendung 
der Theorie der endlichcn Transformationsgruppen," Mathematische Annalen 
35 (1890), 161-197. Schur writes down, as convergent power series, the com- 
position law for a Lie group in terms of "canonical coordinates", i.e., in terms 
of linear coordinates on the Lie algebra. He writes down recursive relations for 
the coefficients, obtaining a version of the formulas we will give below. I am 
indebted to Prof. Schmid for this reference. 



Our strategy for the proof of (1.2) will be to prove a differential version of 

it: 

^ log ((exp A) (exp iB)) = -0((expad A)(expt ad B))B. (1.4) 

Since log(exp A(exp tB)) = A when t = 0, integrating (1.4) from to 1 will 
prove (1.2). Let us define L = T(t) = T(t, A, B) by 

L = log ((cxp A)(expt_B)) . (1.5) 

Then 

exp T = exp A exp tB 

and so 

-^-expL(i) = exp A^- exp tB 
dt dt 

= expA(exptB)B 
= (cxpT(t))B so 

(exp-L(t))^ex P L(t) - B. 
We will prove (1.4) by finding a general expression for 

exp(-C(t))j t exp(C(t)) 
where C = C(t) is a curve in the Lie algebra, g, see (1.11) below. 
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In our derivation of (1.4) from (1.11) we will make use of an important 
property of the adjoint representation which we might as well state now: For 
any g G G, define the linear transformation 

Adj:g^g:I^ gXg^ 1 . 

(In geometrical terms, this can be thought of as follows: (The differential of ) 
Left multiplication by g carries g = Tj(G) into the tangent space, T g (G) to G 
at the point g. Right multiplication by g _1 carries this tangent space back to g 
and so the combined operation is a linear map of g into itself which we call Ad 
g. Notice that Ad is a representation in the sense that 

Ad (gh) = (Ad g)(Ad h) Vg, heG. 

In particular, for any A € g, we have the one parameter family of linear trans- 
formations Ad(exptA) and 

^Ad (exptA)X = (exp t A) AX (exp -t A) + (exptA)X(-A)(exp -tA) 
= (exptA)[A,X](exp-tA) so 

4- Ad exp tA = Ad(cxp tA) o ad A. 
at 

But ad A is a linear transformation acting on g and the solution to the differ- 
ential equation 

j f M{t) = M(i)ad A, M(0) = / 

(in the space of linear transformations of g) is exp t ad A. Thus Ad(exp tA) = 
exp(tadA). Setting t — 1 gives the important formula 

Ad (exp A) = exp(ad A). (1.6) 

As an application, consider the T introduced above. We have 

cxp(adr) = Ad (exp T) 

= Ad ((exp A) (exp tB)) 
= (Ad exp A) (Ad exptB) 
= (exp ad A) (exp ad tB) 

hence 

ad T = log((expad A)(expad tB)). (1.7) 

1.3 The Maurer-Cartan equations. 

If G is a Lie group and 7 = j(t) is a curve on G with 7(0) = A e G, then 
A _1 7 is a curve which passes through the identity at t — 0. Hence A _1 7'(0) is 
a tangent vector at the identity, i.e. an clement of g, the Lie algebra of G. 
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In this way, we have defined a linear differential form 9 on G with values in 
g. In case G is a subgroup of the group of all invertible nxn matrices (say over 
the real numbers), we can write this form as 

9 = A^dA. 



We can then think of the A occurring above as a collection of n 2 real valued 
functions on G (the matrix entries considered as functions on the group) and 
dA as the matrix of differentials of these functions. The above equation giving 
9 is then just matrix multiplication. For simplicity, we will work in this case, 
although the main theorem, equation (1.8) below, works for any Lie group and 
is quite standard. 

The definitions of the groups we are considering amount to constraints on 
A, and then differentiating these constraints show that A _1 dA takes values in 
g, and gives a description of g. It is best to explain this by examples: 

• O(n): AAt = I, dAA^ + AdA* = or 

A~ 1 dA+ (A^dA^ = 0. 
o(n) consists of antisymmetric matrices. 

• Sp(n): Let 

and let Sp(n) consist of all matrices satisfying 

AJA^ = J. 

Then 

dAJa^ + AJdA^ = 

or 

(A~ 1 dA)J+ J{A~ 1 dA)^ = 0. 
The equation BJ + JB^ = defines the Lie algebra sp(n). 

• Let J be as above and define Gl(n,C) to consist of all invertible matrices 
satisfying 

AJ = J A. 

Then 

dAJ = JdA = 0. 



and so 



A- 1 dAJ = A- 1 , JdA = JA~ x dA. 
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or 



We return to general considerations: Let us take the exterior derivative of 
the defining equation 9 = A~ 1 dA. For this we need to compute d(A^ 1 ): Since 

diAA- 1 ) = 

we have 

dA- A- 1 + Ad{A- 1 ) = {) 

diA- 1 ) = -A^dA-A' 1 . 

This is the generalization to matrices of the formula in elementary calculus for 
the derivative of 1/x. Using this formula we get 

d0 = d(A~ 1 dA) = -(A^dA ■ A- 1 ) AdA= -A _1 dA A A~ 1 dA 

or the Maurer-Cartan equation 

d9 + 9h9 = Q. (1.8) 

If we use commutator instead of multiplication we would write this as 

d6+^[6,6} = 0. (1.9) 

The Maurer-Cartan equation is of central importance in geometry and physics, 
far more important than the Campbell-Baker-Hausdorff formula itself. 

Suppose we have a map g : R 2 — > G, with s, t coordinates on the plane. Pull 
9 back to the plane, so 

9 *e = 9 -if s ds + 9 -^dt 

Define 

. . _! dg 
a = a{s,t):=g — 

and 

0:= l3{ S ,t)=g- ld ^ 

so that 

g*9 = ads + 0dt. 

Then collecting the coefficient of ds A dt in the Maurer Cartan equation gives 

S-S + M-a (...0) 

This is the version of the Maurer Cartan equation we shall use in our proof 
of the Campbell Baker Hausdorff formula. Of course this version is completely 
equivalent to the general version, since a two form is determined by its restriction 
to all two dimensional surfaces. 
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1.4 Proof of CBH from Maurer-Cartan. 

Let C(t) be a curve in the Lie algebra g and let us apply (1.10) to 

g(s,t) := exp[sC(t)} 



so that 



a(s,t) = 



= exp[-sC(i)] exp[sC(t)]C(t) 
= C(t) 
idg 



P(s,t) = g ^ 



d 

= exp[-sC(i)]— exp[sC(i)] so by (1.10) 



fSs -C'(*) + [C(*),/3] = o. 
For fixed £ consider the last equation as the differential equation (in s) 

^ = -(ad C)[3 + C, /3(0) = 
as 

where C := C(t), C" := C'(t). 

If we expand /3(s,t) as a formal power series in s (for fixed t): 

P(s, t) = ais + a 2 s 2 + a 3 s 3 H 

and compare coefficients in the differential equation we obtain a x = C , and 

na n — — (ad C)a„_i 

or 

f3(s,t) = sC'(t) + ^s(-ad C(t))C'(t) + ■■■+ ^*"(-ad C^Y^C {t) + ■■■ . 
If we define 

and set s = 1 in the expression we derived above for /3(s,i) we get 

cxp(-C(t))^exp(C(t)) = <H~ad C(t))C'(t). (1.11) 

Now to the proof of the Campbell-Baker-Hausdorff formula. Suppose that 
A and B are chosen sufficiently near the origin so that 

L = T(t) = T(t,A,B) :=log((cxpA)(cxptB)) 
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is defined for all \t\ < 1. Then, as we remarked, 

exp T = exp A exp tB 
so exp ad T — (exp ad A) (exp t ad B) and hence 

ad T = log ((exp ad A) (exp t ad B)) . 

We have 

^- exp Tit) = exp A^- exp tB 
dt dt 

= cxpA(cxptB)B 
= (expT(t)B so 

(exp— T(t))^j- expL(i) = B and therefore 

0(-ad r(t))r'(t) = S by (1.11) so 
0(- log ((exp ad A)(expf ad £)))r'(f) = £. 

Now for \z - 1| < 1 

„— logz 1 

0(-logz) = 



-log 2! 

-log z 

z - 1 



zlogz 



so 



ip(z)(f>(— logz) = 1 where VK 2 ) : = ~ so 

z — 1 

T'(t) = V ((expad ,4)(expiad B)) 5. 
This proves (1.4) and integrating from to 1 proves (1.2). 

1.5 The differential of the exponential and its 
inverse. 

Once again, equation (1.11), which we derived from the Maurer-Cartan equa- 
tion, is of significant importance in its own right, perhaps more than the use we 
made of it - to prove the Campbcll-Bakcr-Hausdorff theorem. We will rewrite 
this equation in terms of more familiar geometric operations, but first some 
preliminaries: 

The exponential map exp sends the Lie algebra g into the corresponding Lie 
group, and is a differentiable map. If £ e g we can consider the differential of 
exp at the point £: 

d(exp) ? :g = Tg ? ^TGexpj 
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where we have identified g with its tangent space at £ which is possible since g 
is a vector space. In other words, d(exp)^ maps the tangent space to g at the 
point £ into the tangent space to G at the point exp(£). At £ = we have 

d(exp) = id 

and hence, by the implicit function theorem, d(exp)^ is invertible for suffi- 
ciently small £. Now the Maurer-Cartan form, evaluated at the point exp£ 
sends TG oxp £ back to g: 

^cxp £ ■ TGcxp^ * g- 

Hence 

OX p£ ° rf(exp) e : g -> g 
and is invertible for sufficiently small £. We claim that 

r(ad£) o (<9 oxp ? o d(exp c )) = id (1.12) 

where r is as defined above in (1.3). Indeed, we claim that (1.12) is an immediate 
consequence of (1.11). 

Recall the definition (1.3) of the function r as t(z) — l/4>(—z). Multiply 
both sides of (1.11) by r(adC(t)) to obtain 

r(adC(t))exp(-C(t))|-exp(C(t)) = C"(t). (1.13) 

Choose the curve C so that £ = C(0) and r\ = C"(0). Then the chain rule says 
that 

— exp(C(t))| t=0 = d(exp)$(»7). 

Thus 

^exp(-C(i))^exp(C(i))^ = 6» cxpe d(exp) c ?7, 

the result of applying the Maurer-Cartan form 9 (at the point exp(£)) to the 
image of 77 under the differential of exponential map at £ G g. Then (1.13) at 
t = translates into (1.12). QED 



1.6 The averaging method. 

In this section we will give another important application of (1.10): For fixed 
£ G g, the differential of the exponential map is a linear map from g = 7|(g) to 
lexp^G. The (differential of) left translation by exp£ carries T exp ^(G) back to 
T e G = g. Let us denote this composite by exp^ 1 rf(exp)^. So 

0cx P £ o rf(exp) 5 = dexpjT 1 d(exp) ? : g^g 
is a linear map. We claim that for any 77 G g 

exp^T 1 rf(exp)|(r7) = / Ad cxp (_ s?) ?7ds. (1.14) 
Jo 
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We will prove this by apply ing( 1.10) to 

g(s,t) = cxp(t(£ + srf)) . 

Indeed, 

0(a,t) :^g(s,t)- 1 ^=i + .sr 1 

so 

d(3 

and 

/3(o,t) = e 

The left hand side of (1.14) is a(0, 1) where 

a(a,t) := <7(M) -1 ^ 
so we may use (1.10) to get an ordinary differential equation for a(0, t). Defining 

7(t) :=a(0,t), 

(1.10) becomes 

^=«7+[7^]- (1-15) 

For any (eg, 

-^Ad cxp -^C = Adexp-tilCC] 
= [Ad oxp _ tc C,£]. 

So for constant (eg, 

Ad oxp _ t5 C 

is a solution of the homogeneous equation corresponding to (1.15). So, by 
Lagrange's method of variation of constants, we look for a solution of (1.15) of 
the form 

7(t) = Ad«p-«C(t) 

and (1.15) becomes 

C'(t) - Ad oxpt4 77 

or 



7(i) = Ad cxp _ t? / Ad cxp si rids 
Jo 

is the solution of (1.15) with 7(0) = 0. Setting s = 1 gives 

7(1) = Ad oxp _ ? / Ad cxpsi ds 
Jo 

and replacing s by 1 — s in the integral gives (1.14). 
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1.7 The Euler MacLaurin Formula. 

We pause to remind the reader of a different role that the r function plays in 
mathematics. We have seen in (1.12) that r enters into the inverse of the 
exponential map. In a sense, this formula is taking into account the non- 
commutativity of the group multiplication, so r is helping to relate the non- 
commutative to the commutative. 

But much earlier in mathematical history, r was introduced to relate the 
discrete to the continuous: Let D denote the differentiation operator in one 
variable. Then if we think of D as the one dimensional vector field d/dh it 
generates the one parameter group exp hD which consists of translation by h. 
In particular, taking h = 1 we have 

(e D f) (x) = f(x+l). 

This equation is equally valid in a purely algebraic sense, taking / to be a 
polynomial and 

e D = l + D+-D 2 + -D 3 + --- . 

This series is infinite. But if p is a polynomial of degree d, then D k p = for 
k > D so when applied to any polynomial, the above sum is really finite. Since 

D k e ah = a k e ah 

it follows that if F is any formal power series in one variable, we have 

F(D)e ah = F(a)e ah (1.16) 

in the ring of power series in two variables. Of course, under suitable convergence 
conditions this is an equality of functions of h. 

For example, the function r(z) = 2/(1 — e~ z ) converges for \z\ < 2ir since 
±2jri are the closest zeros of the denominator (other than 0) to the origin. Hence 

\dhj z l-e~ z y ' 

holds for < \z\ < 2ir. Here the infinite order differential operator on the left 
is regarded as the limit of the finite order differential operators obtained by 
truncating the power series for r at higher and higher orders. 

Let a < b be integers. Then for any non-negative values of hi and /12 we 
have 

rb+h 2 bz az 

/ e zx dx = e h2Z e- h * z 

Ja-hi z z 

for z ^ 0. So if we set 

dh\ dn 2 
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the for < \z\ < 2n we have 



r(D 1 )r(D 2 ) 



b+h 2 



^h 2 z< 



e zx dx = T(z)e h 

a-hi z 



T (-z)e- hl2 



because t(Di)/(/i 2 ) = /(/12) when applied to any function of hi since the con- 
stant term in r is one and all of the differentiations with respect to hi give 
zero. 

Setting hi = h 2 = gives 



t(Di)t 



pb+h 2 

(D 2 ) / 

J a — hi 



e zx dx 



J)Z 



+ 



,0 < \z\ < 2tt. 



h 1= h 2 =0 



On then other hand, the geometric sum gives 

b 



k=a 



,kz 



1 



D (b-a) : 



„bz 



I _ g(6-0+l)2 

1 - e z 



+ 



l-e z l-e~ z 

We have thus proved the following exact Euler-MacLaurin formula: 



rb+h 2 

-(Di)t(D 2 ) / f(x)d 

J a — hi 



h 1= h 2 =0 k=a 



(1.18) 



where the sum on the right is over integer values of k and we have proved this 
formula for functions / of the form f(x) = e zx , < \z\ < 2tt. It is also true 
when z — by passing to the limit or by direct evaluation. 

Repeatedly differentiating (1.18) (with f(x) — e zx ) with respect to z gives 
the corresponding formula with f(x) — x n e zx and hence for all functions of the 
form x 1— > p(x)e zx where p is a polynomial and \z\ < 2tt. 

There is a corresponding formula with remainder for C k functions. 



1.8 The universal enveloping algebra. 

We will now give an alternative (algebraic) version of the Campbcll-Baker- 
Hausdorff theorem. It depends on several notions which are extremely important 
in their own right, so we pause to develop them. 

A universal algebra of a Lie algebra L is a map e : L — > UL where UL is 
an associative algebra with unit such that 

1. e is a Lie algebra homomorphism, i.e. it is linear and 



e[x,y] = e(x)e(y) - e(y)e(x) 
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2. If A is any associative algebra with unit and a : L — > A is any Lie algebra 
homomorphism then there exists a unique homomorphism <f> of associative 
algebras such that 

a = <f> o e. 

It is clear that if UL exists, it is unique up to a unique isomorphism. So 
we may then talk of the universal algebra of L. We will call it the universal 
enveloping algebra and sometimes put in parenthesis, i.e. write U(L). 

In case L = g is the Lie algebra of left invariant vector fields on a group 
G, we may think of L as consisting of left invariant first order homogeneous 
differential operators on G. Then we may take UL to consist of all left invariant 
differential operators on G. In this case the construction of UL is intuitive 
and obvious. The ring of differential operators T> on any manifold is filtered by 
degree: V n consisting of those differential operators with total degree at most 
n. The quotient, T> n /I?™ -1 consists of those homogeneous differential operators 
of degree n, i.e. homogeneous polynomials in the vector fields with function 
coefficients. For the case of left invariant differential operators on a group, these 
vector fields may be taken to be left invariant, and the function coefficients to be 
constant. In other words, (U L) n / (U L)™" 1 consists of all symmetric polynomial 
expressions, homogeneous of degree n in L. This is the content of the Poincare- 
Birkhoff-Witt theorem. In the algebraic case we have to do some work to get 
all of this. We first must construct U(L). 

1.8.1 Tensor product of vector spaces. 

Let Ei, ... , E m be vector spaces and (/, F) a multilinear map / : E x x • • • x E m — > 
F. Similarly (g, G). If £ is a linear map £ : F — > G, and g = £o f then we say 
that £ is a morphism of (/, F) to (g, G). In this way we make the set of all (/, F) 
into a category. Want a universal object in this category; that is, an object with 
a unique morphism into every other object. So want a pair (i, T) where T is a 
vector space, t : Ei x • • • x E m — > T is a multilinear map, and for every (/, F) 
there is a unique linear map £f : T — > F with 

f = £ f ot 

Uniqueness. By the universal property t = £' t ot' , t' = £' t ot so t = (£' t o£ t ,)ot, 
but also t = toid. So £' t o £ v =id. Similarly the other way. Thus (t,T), if it 
exists, is unique up to a unique morphism. This is a standard argument valid 
in any category proving the uniqueness of "initial elements" . 

Existence. Let M be the free vector space on the symbols xi , . . . , x m , xi <G 
Ei. Let N be the subspace generated by all the 

{xi, • • • , Xi -\- X^, . . . , X m ) {xi, . . . ,Xi, . . . , x m ) {xi , . . . , x^, . . . , x m ) 

and all the 

{xi, • ■ • , , dXi, . . . , x m ^j a(xi, . . . , Xi, - . • , x m ) 
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for alH = 1, . . . , m, x i7 x\ e £?j, a £ k. Let T = M/iV and 

t((X!, x m )) = {x\,..., x m )/N. 

This is universal by its very construction. QED 
We introduce the notation 

T = T(E 1 x • • • x E m ) =: Ex ® • • • ® E m . 

The universality implies an isomorphism 

(E 1 ® • • • <g> E m ) ® (£ m+1 (g) • • • <g> £ m+rl ) Ei ® • • • <g> £ TO+ „. 

1.8.2 The tensor product of two algebras. 

If A and B are algebras, they are they are vector spaces, so we can form their 
tensor product as vector spaces. We define a product structure on A <g> B by 
defining 

(ai <g> 6i) • (a 2 <g> 62) := aia 2 ® hb 2 . 

It is easy to check that this extends to give an algebra structure on A ® B. In 
case A and B are associative algebras so is A ® B, and if in addition both A 
and B have unit elements, then I a <g 1 B is a unit clement for A® B. We will 
frequently drop the subscripts on the unit elements, for it is easy to see from 
the position relative to the tensor product sign the algebra to which the unit 
belongs. In other words, we will write the unit for A ® B as 1 <E> 1. We have an 
isomorphism of A into A® B given by 

a 1— ► a ® 1 

when both A and B are associative algebras with units. Similarly for B. Notice 
that 

(a ® 1) • (1 ® b) = a ® b = (1 <g> b) ■ (a ® 1). 

In particular, an clement of the form a ® 1 commutes with an element of the 
form 1 ® b. 

1.8.3 The tensor algebra of a vector space. 

Let V be a vector space. The tensor algebra of a vector space is the solution 
of the universal problem for maps a of V into an associative algebra: it consists 
of an algebra TV and a map 1 : V — > TV such that t is linear, and for any linear 
map a : V — » A where A is an associative algebra there exists a unique algebra 
homomorphism tp : TV — > A such that a — V> o 1. We set 

T n V := V <g> ■ ■ ■ <g> V rc - factors. 

We define the multiplication to be the isomorphism 

T n V ® T m V — > T" +m V" 
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obtained by "dropping the parentheses," i.e. the isomorphism given at the end 
of the last subsection. Then 

tv :=0ry 

(with T°V the ground field) is a solution to this universal problem, and hence 
the unique solution. 

1.8.4 Construction of the universal enveloping algebra. 

If we take V — L to be a Lie algebra, and let / be the two sided ideal in TL 
generated the elements [x,y] — x <£> y + y <£> x then 

UL := TL/I 

is a universal algebra for L. Indeed, any homomorphism a of I into an associa- 
tive algebra A extends to a unique algebra homomorphism ip : TL — ► A which 
must vanish on / if it is to be a Lie algebra homomorphism. 

1.8.5 Extension of a Lie algebra homomorphism to its uni- 
versal enveloping algebra. 

If h : L — ► M is a Lie algebra homomorphism, then the composition 

e M °h: L-f UM 

induces a homomorphism 

UL -» UM 

and this assignment sending Lie algebra homomorphisms into associative algebra 
homomorphisms is functorial. 

1.8.6 Universal enveloping algebra of a direct sum. 

Suppose that: L = L\ ® L 2 , with : Lj — > U(Li), and e : L — > C/(L) the 
canonical homomorphisms. Define 

f : L —* U{L t ) ® [/(La), /(xi + ar 2 ) = ei(xi) <g> 1 + 1 <g> e 2 (a;2)- 

This is a homomorphism because x\ and X2 commute. It thus extends to a 
homomorphism 

V>:t/(L)^t/(Li)®[/(L 2 ). 

Also, 

xi i— » e(xi) 

is a Lie algebra homomorphism of L\ — ► [/ (L) which thus extends to a unique 
algebra homomorphism 

01 : tf(Li) -» U(L) 
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and similarly (f> 2 : U(L 2 ) — > f (£)• We have 

0i (zi)</>2 (x 2 ) = (h{x2)4>i{xi), x\ e £i,X2 e £ 2 

since [xi,x 2 ] = 0. As the e^x,) generate ?7(Lj), the above equation holds 
with iTj replaced by arbitrary elements %n £ U(Li),i = 1,2. So we have a 
homomorphism 

4> : U{L{) ® J7(L 2 ) -» C/(L), 0(ui <8> u 2 ) := M^M^)- 

We have 

o ip(xi + x 2 ) = <p(xi <E> 1) + 0(1 <8> x 2 ) = xi + x 2 
so <f> o ip = id, on L and hence on U(L) and 

V> o 0(ori <£> 1 + 1 <g> x 2 ) = x\ <g> 1 + 1 <g> x 2 

so V ° = id ° n L\ <g> 1 + 1 (g> L 2 and hence on U(L\) ® U(L 2 ). Thus 

c/(L! e l 2 ) = £^(ii) ® ^(£2). 

1.8.7 Bialgebra structure. 

Consider the map L — > E/(L) <8> £^(£): 

Then 

(a; <g> 1 + 1 <g> x)(y <g> 1 + 1 ® y) = 
xy£g)l + x<g)y + 2/Cg)x + +l<X> xy, 
and multiplying in the reverse order and subtracting gives 

[x® 1 + 1® x,y®l + l®y] = [x,y] ® 1 + 1® [x,y]. 

Thus the map x t— » x ® 1 + 1 ® x determines an algebra homomorphism 

A : l/(L) -» U(L)®U(L). 

Define 

e:U(L)^k, e(l) = 1, e(x) = 0,xeL 
and extend as an algebra homomorphism. Then 

(£:<g>id)(x<£> 1 + 1 ® x) = 1 <g) x, x e L. 

We identify k® L with L and so can write the above equation as 

(e®id)(x® 1 + 1 ®x) = x, x E L. 
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The algebra homomorphism 

(e <g) id) o A : C/(L) -» C/(L) 
is the identity (on 1 and on) L and hence is the identity. Similarly 

(id <g) e) o A = id. 

A vector space C with a map A : C — > C<X> C, (called a comultiplication) and 
a map e : £> — > fc (called a co-unit) satisfying 

(e <g> id) o A = id 

and 

(id <g> e) o A = id 

is called a co-algebra. If C is an algebra and both A and e are algebra homo- 
morphisms, we say that C is a bi-algebra (sometimes shortened to "bigebra"). 
So we have proved that (U(L),A,e) is a bialgebra. 
Also 

[(A <g> id) o A](x) = x <g> 1 <g> 1 + 1 <g> a; <g> 1 + 1 <g> 1 <g> x = [(id (g) A) o A](x) 

for x € L and hence for all elements of Hence the comultiplication is is 

coassociative. (It is also co-commutative.) 

1.9 The Poincare-Birkhoff-Witt Theorem. 

Suppose that V is a vector space made into a Lie algebra by declaring that 
all brackets are zero. Then the ideal / in TV defining U(V) is generated by 
x®y — y®x, and the quotient TV /I is just the symmetric algebra, SV. So the 
universal enveloping algebra of the trivial Lie algebra is the symmetric algebra. 

For any Lie algebra L define U n L to be the subspace of UL generated by 
products of at most n elements of L, i.e. by all products 

e(ari) • • • e(x m ), m < n. 

For example,, 

UqL = k, the ground field 

and 

U 1 L = k®e{L). 

We have 

U Q L c U\L c • • • C U n L c U n+1 L c • • • 

and 

U m L ■ U n L C U m + n L. 



1.9. THE POINCARE-BIRKHOFF- WITT THEOREM. 



25 



We define 

gr„ UL := U n L/U n -xL 

and 

grUL :-0gr„ UL 

with the multiplication 

gr m UL x gr„ C/L — > gr TO+ „ [/ L 

induced by the multiplication on UL. 

If a G J7 n L we let a G gr n [/Z, denote its image by the projection U n L — » 
U n L/U n _iL = gr n UL. We may write a as a sum of products of at most n 
elements of L: 

a = c i^ e ( x ^)--- € ( x ^m fl .)- 

m M <n 

Then a can be written as the corresponding homogeneous sum 

In other words, as an algebra, gr UL is generated by the elements e(x), x G L. 
But all such elements commute. Indeed, for x,y G L, 

e ( x ) e (y) - e (y) e ( x ) = 

by the defining property of the universal enveloping algebra. The right hand 
side of this equation belongs to U\L. Hence 

e( x )e(y) - e (y) e ( x ) = o 

in gr 2 UL. This proves that gr UL is commutative. Hence, by the universal 
property of the symmetric algebra, there exists a unique algebra homomorphism 

w : SL — > gr UL 

extending the linear map 

L^>grUL, xi-^e(x). 

Since the e(x) generate gr UL as an algebra, we know that this map is surjective. 
The Poincare-Birkhoff-Witt theorem asserts that 

w : SL — > gr UL is an isomorphism. (1-19) 

Suppose that we choose a basis Xi, i G / of L where / is a totally ordered 
set. Since 

e(xi)e(xj) = e(xj)e(xi) 

we can rearrange any product of e(xi) so as to be in increasing order. This 
shows that the elements 

x M ■= eixii) ■ ■ ■ e(x im ), M := (i u . . . , i m ) i t < ■ ■ ■ i m 

span UL as a vector space. We claim that (1.19) is equivalent to 
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Theorem 1 Poincare-Birkhoff-Witt. The elements xm form a basis ofUL. 

Proof that (1.19) is equivalent to the statement of the theorem. For 

any expression xm as above, we denote its length by £{M) — m. The elements 
xm are the images under w of the monomial basis in S m (L). As we know that 
w is surjective, equation (1.19) is equivalent to the assertion that w is injective. 
This amounts to the non-existence of a relation of the form 

^ c M x M = c m xm 

£(M)=n t(M)<n 

with some non-zero coefficients on the left hand side. But any non-trivial rela- 
tion between the xm can be rewritten in the above form by moving the terms 
of highest length to one side. QED 

We now turn to the proof of the theorem: 

Let V be the vector space with basis zm where M runs over all ordered 
sequences i\ < %2 < • • • < i n . (Recall that we have chosen a well ordering on / 
and that the Xi , e j form a basis of L.) 

Furthermore, the empty sequence, Z0 is allowed, and we will identify the 
symbol Z0 with the number 1 £ k. If i £ I and M = (ii,...,i n ) we write 
i < M if i < i\ and then let (i, M) denote the ordered sequence . . . ,i n ). 

In particular, we adopt the convention that if M = is the empty sequence then 
i < M for all i in which case (i, M) — (i). Recall that if M = . . . , i n )) we 
set £(M) = n and call it the length of M. So, for example, £(i, M) = £(M) + 1 
if i < M. 

Lemma 1 We can make V into an L module in such a way that 

XiZM = ZiM whenever i < M. (1-20) 

Proof of lemma. We will inductively define a map 

L x V — > V, (x, v) i— > xv 

and then show that it satisfies the equation 

xyv — yxv = [x, y]v, x,y £ L, v £ V, (1-21) 

which is the condition that makes V into an L module. Our definition will be 
such that (1.20) holds. In fact, we will define xiZm inductively on £(M) and on 
i. So we start by defining 

XiZ$ — 

which is in accordance with (1.20). This defines xiZm for £{M) = 0. For 
£(M) = 1 we define 

XiZ(j) = Z( itj ) if i < j 

while if i > j we set 

XiZ U) = x jZ{l) + [ Xi , xj]z $ = z Uii) + c % z ik) 
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where 

[Xi,Xj] = J2 c ij x k 

is the expression for the Lie bracket of Xi with Xj in terms of our basis. These 
c^j are known as the structure constants of the Lie algebra, L in terms of 
the given basis. Notice that the first of these two cases is consistent with (and 
forced on us) by (1.20) while the second is forced on us by (1.21). We now have 
defined XiZu for all i and all M with £{M) < 1, and we have done so in such a 
way that (1.20) holds, and (1.21) holds where it makes sense (i.e. for £{M) = 0). 

So suppose that we have defined XjZN for all j if £{N) < £(M) and for all 
j < i if £(N) = £(M) in such a way that 

XjZN is a linear combination of z^s with £(L) < £(N) + 1 (*). 

We then define 

ZiM if i < M 

XiZ M = (1-22) 
Xj(xiZN) + [xi, Xj]zN if M = (jN) with i > j. 

This makes sense since XiZN is already defined as a linear combination of z^s 
with £(L) < £(N) + 1 = £(M) and because [ x i j x j j c cin be written as a linear 
combination of the Xk as above. Furthermore (*) holds with j and N replaced 
by M. Furthermore, (1.20) holds by construction. We must check (1.21). By 
linearity, this means that we must show that 

XiXjZN - XjXiZ N = [Xi,Xj]z N - 

If i = j both sides are zero. Also, since both sides are anti-symmetric in i and 
j, we may assume that i > j. If j < N and i > j then this equation holds by 
definition. So we need only deal with the case where j ^ N which means that 
N = (kP) with k < P and i > j > k. So we have, by definition, 

XjZ N = XjZ(kP) 
= XjXkZp 

= X k XjZp + [Xj,X k ]z P . 

Now if j < P then XjZp = Ztjp) and k < (jP). If j ^ P then XjZp = zq + w 
where still k < Q and w is a linear combination of elements of length < £(N) . 
So we know that (1.21) holds for x = Xi,y — x^ and v — zup\ (if j < P) or 
v = zq (otherwise). Also, by induction, we may assume that we have verified 
(1.21) for all N' of length < £(N). So we may apply (1.21) to x = x u y = x k 
and v = XjZp and also to x = Xi, y — [xj,Xk], v — zp. So 

XiXjZ N = x k x l x J zp + [xi,x k ]xjZp + [xj,x k ]xiZ P + [x it [xj,x k ]]z P . 

Similarly, the same result holds with i and j interchanged. Subtracting this 
interchanged version from the preceding equation the two middle terms from 
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each equation cancel and we get 

(x^Xj XjXi*)Z]\[ — X k {x^Xj XjXj^Zp -\- {^Xi 1 ^XjjX k \\ \Xj , [x^,X/;;])zp 
= Xk[Xi,Xj]z P + ([Xi, [Xj,Xk\] - [Xj, [xi,x k ])z P 

= \xi,Xj\x k z P + ([xk, [xi,Xj]} + [x^ [xj,Xk]] - [xj, [x l ,x k ])z P 

= [Xi, Xj\Zff. 

(In passing from the second line to the third we used (1.21) applied to zp (by 
induction) and from the third to the last we used the antisymmetry of the 
bracket and Jacobi's equation. )QED 

Proof of the PBW theorem. We have made V into an L and hence into 
a U(L) module. By construction, we have, inductively, 

xmz% — z M - 
But if _ 

c M x M = o 

then 

= ^2 °mZm = °mXm} z % 
contradicting the fact the the zm are independent. QED 

In particular, the map e : L U(L) is an injection, and so we may identify 
I as a subspace of U(L). 

1.10 Primitives. 

An clement x of a bialgebra is called primitive if 

A(x) = x®l + l<gix. 

So the elements of L are primitives in U (L) . 
We claim that these are the only primitives. 

First prove this for the case L is abelian so U(L) = S(L). Then we may 
think of S(L) <£> S(L) as polynomials in twice the number of variables as those 
of S(L) and 

A(f)(u,v) = f(u + v). 
The condition of being primitive says that 

f(u + v) = f(u) + f(v). 

Taking homogeneous components, the same equality holds for each homogeneous 
component. But if / is homogeneous of degree n, taking u — v gives 

2"/(u) - 2/(«) 
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so / = unless n = l. 

Taking gr, this shows that for any Lie algebra the primitives are contained 
in Ux(L). But 

A(c + x) = c(l <g> 1) + x <g> 1 + 1 <g> a; 
so the condition on primitivity requires c = 2c or c = 0. QED 

1.11 Free Lie algebras 

1.11.1 Magmas and free magmas on a set 

A set M with a map: 

M x M —> M, (x, y) i ► xy 

is called a magma. Thus a magma is a set with a binary operation with no 
axioms at all imposed. 

Let X be any set. Define X n inductively by X\ := X and 

X n = |] JV p x X q 

p+q=n 

for Ti > 2. Thus X2 consists of all expressions ab where a and b are elements of 
X. (We write a6 instead of (a, &).) An clement of X 3 is either an expression of 
the form (ab)c or an expression of the form a(bc). An element of X 4 has one 
out of five forms: a((bc)d), a(b(cd)), ((ab)(cd)), ((ab)c)d or (a(bc))d. 
Set 

OO 

M x :=Y[X n . 

n=l 

An element w G M x is called a non-associative word, and its length £(w) is the 
unique n such that t« G X„. We have a "multiplication" map Mx x Mx given 
by the inclusion 

X X q =— » 

Thus the multiplication on Mjf is concatenation of non-associative words. 

If TV is any magma, and / : X — > TV is any map, we define F : Mx — » TV by 
F = f on X 1 , by 

i^ 1 : A 2 -» /V, F(o6) = /(a)/(6) 

and inductively 

F : l p x 1 5 ^ JV, F(uu) = F(u)F(v). 

Any element of A"„ has a unique expression as uv where u € X p and for 
a unique (p, g) with p + q = n, so this inductive definition is valid. 

It is clear that F is a magna homomorphism and is uniquely determined 
by the original map /. Thus Mx is the "free magma on X" or the "universal 
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magma on X" in the sense that it is the solution to the universal problem 
associated to a map from X to any magma. 

Let A x be the vector space of finite formal linear combinations of elements 
of Mx ■ So an element of A x is a finite sum ^2 c m m with m £ Mx and c m in 
the ground field. The multiplication in Mx extends by bi-lincarity to make Ax 
into an algebra. If we are given a map X — » B where B is any algebra, we get 
a unique magna homomorphism Mx — * B extending this map (where we think 
of B as a magma) and then a unique algebra map Ax — > B extending this map 
by linearity. 

Notice that the algebra Ax is graded since every element of Mx has a length 
and the multiplication on M x is graded. Hence A x is the free algebra on X 
in the sense that it solves the universal problem associated with maps of X to 
algebras. 

1.11.2 The Free Lie Algebra L x . 

In Ax let / be the two-sided ideal generated by all elements of the form aa, a e 
Ax and (ab)c + (bc)a + (ca)b, a, 6, c e Ax- We set 

L x := A x /I 

and call Lx the free Lie algebra on X. Any map from X to a Lie algebra L 
extends to a unique algebra homomorphism from Lx to L. 

We claim that the ideal / defining Lx is graded. This means that if a — a n 
is a decomposition of an clement of / into its homogeneous components, then 
each of the a n also belong to /. To prove this, let J C / denote the set of all 
a = ^2a n with the property that all the homogeneous components a n belong 
to /. Clearly J is a two sided ideal. We must show that I C J. For this it is 
enough to prove the corresponding fact for the generating elements. Clearly if 

a = ^2a p , b = b q , c = c r 

then 

(ab)c + (bc)a + (ca)b = ^ {{a p b q )c r + (b q c r )a p + (c r a p )b q ) . 

p,q,r 

But also if x = ^2 x m then 

x ^ ^ x n -\- ^ ^ {x m x n -\- x n x rn ^) 

m<n 

and 

so / C J. 

The fact that / is graded means that Lx inherits the structure of a graded 
algebra. 
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1.11.3 The free associative algebra Ass(X). 

Let Vx be the vector space of all finite formal linear combinations of elements 
of X. Define 

Ass x =T(V x ), 

the tensor algebra of Vx ■ Any map of X into an associative algebra A extends to 
a unique linear map from V x to A and hence to a unique algebra homomorphism 
from Ass x to A. So Assx is the free associative algebra on X. 

We have the maps X — > L x and e : — > U{L X ) and hence their com- 
position maps X to the associative algebra U(L X ) and so extends to a unique 
homomorphism 

* : Ass x — > U(L X ). 

On the other hand, the commutator bracket gives a Lie algebra structure to 
Assx and the map X — > Assx thus give rise to a Lie algebra homomorphism 

L x -» Ass x 

which determines an associative algebra homomorphism 

$ : U(L X ) — » Assx- 

both compositions $ o \I> and ^ o $ are the identity on X and hence, by unique- 
ness, the identity everywhere. We obtain the important result that U(Lx) and 
Assjc are canonically isomorphic: 

U(L X ) =Ass X - (1.23) 

Now the Poincare-Birkhoff -Witt theorem guarantees that the map e : Lx — > 
U(Lx) is injective. So under the above isomorphism, the map Ljf — > Assx is 
injectivc. On the other hand, by construction, the map X — ► Vx induces a 
surjective Lie algebra homomorphism from Lx into the Lie subalgebra of Assx 
generated by X. So we see that the under the isomorphism (1.23) Lx C U(Lx) 
is mapped isomorphically onto the Lie subalgebra of Assx generated by X. 

Now the map 

X — ► Assx <8> Assx , xh^x(8)1 + 1(8)x 
extends to a unique algebra homomorphism 

A : Assx — > Assx (g) Assx . 
Under the identification (1.23) this is none other than the map 

A:U(L x )^U(L x )®U(Lx) 
and hence we conclude that Lx is the set of primitive elements of Assx ■ 

L x = {w e Assx|A(w) = w ® 1 + 1 <E> tu.} (1-24) 
under the identification (1.23). 
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1.12 Algebraic proof of CBH and explicit for- 
mulas. 

We recall our constructs of the past few sections: X denotes a set, Lx the free 
Lie algebra on X and Assx the free associative algebra on X so that Assx 
may be identified with the universal enveloping algebra of Lx ■ Since Assx may 
be identified with the non-commutative polynomials indexed by X, we may 
consider its completion, Fx, the algebra of formal power scries indexed by X. 
Since the free Lie algebra Lx is graded we may also consider its completion 
which we shall denote by Lx- Finally let to denote the ideal in Fx generated 
by X. The maps 

exp : m — » 1 + to, log : 1 + to — > to 

are well defined by their formal power series and are mutual inverses. (There is 
no convergence issue since everything is within the realm of formal power series.) 
Furthermore exp is a bijection of the set of a E m satisfying Act = a ® 1 + 1 <g> a 
to the set of all (3 £ 1 + to satisfying A/3 = (3 ® /3. 

1.12.1 Abstract version of CBH and its algebraic proof. 

In particular, since the set {[3 <G 1 + m\Af3 = /3<g> 0} forms a group, we conclude 
that for any A, B G h x there exists a C e Lx such that 

exp C = (exp A) (exp B) . 

This is the abstract version of the Campbell-Baker-Hausdorff formula. It de- 
pends basically on two algebraic facts: That the universal enveloping algebra of 
the free Lie algebra is the free associative algebra, and that the set of primitive 
elements in the universal enveloping algebra (those satisfying Act = a® 1 + 1® a) 
is precisely the original Lie algebra. 

1.12.2 Explicit formula for CBH. 

Define the map 

<f> : to n Assx — > Lx , 
$(xi ...x n ) := [xi, [x 2 , ■ • ■ , [x n -i,x n ] ■■■] = ad(xi) • • • ad(ar n _i)(ar n ), 

and let : Assx — > End(Lx) be the algebra homomorphism extending the Lie 
algebra homomorphism ad : Lx — > End(Lx)- We claim that 

= 9(u)$(u), V u e Assx, v e to n Ass x . (1-25) 

Proof. It is enough to prove this formula when u is a monomial, 

We do this by induction on n. For n = the assertion is obvious and for n = 1 
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it follows from the definition of Suppose n > 1. Then 
$(x! ■ ■ -x n v) = Q(x 1 )$(x 2 ■ ■ -x n v) 

= e(x 1 )e(x 2 ...x n )<s>(v) 

= <d(x! ■ ■ ■ x n )$(v). QED 

Let L x denote the n— th graded component of Lx- So L x consists of linear 
combinations of elements of X, L\ is spanned by all brackets of pairs of elements 
of X, and in generalL^- is spanned by elements of the form 

[u, v] , n6 L p x , v e L\ , p + q = n. 

We claim that 

= nii V u e L x . (1.26) 

For n = 1 this is immediate from the definition of So by induction it is 
enough to verify this on elements of the form [u,v] as above. We have 

3>([u, v\) — (fr(uv — vu) 

= e(u)$(u) - e(«)$(u) 

= qO(u)v — pQ(v)u by induction 
= q[u,v]-p[v,u] 

since Q(w) — &d(w) for w G 
= {p + q)[u,v] QED. 



We can now write down an explicit formula for the n— th term in the 
Campbell-Baker-Hausdorff expansion. Consider the case where X consists of 
two elements X — {x, y}, x ^ y. Let us write 



z = log((expa;)(expy)) zeL x , z = ^ j z n {x,y). 



We want an explicit expression for z n (x,y). We know that 



Zn = -$0n) 

n 
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and z n is a sum of non-commutative monomials of degree n in x and y. Now 

/ 00 X P \ / °° y<? \ 

,c xpI „cx P! ,) = gjfjgyj 

P+<?>1 ^ y 
2 = log((expx)(expy)) 

/ x P y q 



E 



. ^— ' p!o! 

m=l \p+<?>1 



We want to apply to the terms in this last expression which are of total 
degree n so as to obtain z n . So let us examine what happens when we apply <f> 
to an expression occurring in the numerator: If q m > 2 we get since we will 
have &d(y)(y) — 0. Similarly we will get if q m = 0,p m > 2. Hence the only 
terms which survive are those with q m = 1 or q m — 0,p m — 1. Accordingly we 
decompose z n into these two types: 

1 x 



n 



p+q=n 



where 



z p,q 



and 



p,q 



^ (-l) m+1 &d(x) Pl <id(y)' 11 ■■■&<!( x) p ™y 

> ^ ; ^ ^ — - summed over all 

^ m pilgi! • • • p m \ 

Pi H 1- Pm = P, 91 H h 9m-l =5-1, % + P» > 1, Pra > 1 



v-^ (-1 m+1 adfa; ) Pl a,d(y) qi •• • ad(y 

> — V^i — summed over 

^ m Piki' ■ ■ ■ Qm-i' 

Pi H l-p m -i=p-l, qi H h 9m-l = 9, 

Pi + 9i > 1 (i = 1,. .. ,m- 1) <? m -i > 1. 



The first four terms are: 



zi{x,y) = x + y 

Z2(x,y) = -[x,y] 

zz{x lV ) = ^[x, [x,y]] + ^[y, [y,x]] 

z4x,y) = ^[x,[y,[x,y]]]. 



Chapter 2 



5/(2) and its 
Representations. 



In this chapter (and in most of the succeeding chapters) all Lie algebras and 
vector spaces are over the complex numbers. 

2.1 Low dimensional Lie algebras. 

Any one dimensional Lie algebra must be commutative, since [A", X] = in any 
Lie algebra. 

If g is a two dimensional Lie algebra, say with basis X } Y then [aX+bY, cX + 
dY] = (ad — be) [X, Y], so that there are two possibilities: [X, Y] = in which 
case g is commutative, or [X, Y] ^ 0, call it B, and the Lie bracket of any 
two elements of g is a multiple of B. So if C is not a multiple of B, we have 
[C, B] = cB for some 0, and setting A = c~ x C we get a basis A, B of g with 
the bracket relations 

[A,B] = B. 

This is an interesting Lie algebra; it is the Lie algebra of the group of all affinc 
transformations of the line, i.e. all transformations of the form 

x t-^ ax + b, a^O. 

For this reason it is sometimes called the "ax + b group" . Since 

a b\ / x\ _ fax + b 

o iJW V 1 

we can realize the group of affinc transformations of the line as a group of two 
by two matrices. Writing 

a = cxptA, b = tB 
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(o ?) =cxp< (o o)' (o ?) =cxp *(o o) 

we see that our algebra g with basis A, B and [A, B] = B is indeed the Lie 
algebra of the ax + b group. 

In a similar way, we could list all possible three dimensional Lie algebras, by 
first classifying them according to dim[g, g] and then analyzing the possibilities 
for each value of this dimension. Rather than going through all the details, we 
list the most important examples of each type. If dim[g,g] = the algebra is 
commutative so there is only one possibility. 

A very important example arises when dim[g,g] = 1 and that is the Heisen- 
berg algebra, with basis P,Q,Z and bracket relations 

[P,Q] = Z, [Z,P] = [Z,Q] = 0. 

Up to constants (such as Planck's constant and i) these are the famous Heisen- 
berg commutation relations. Indeed, we can realize this algebra as an algebra 
of operators on functions of one variable x: Let P = D = differentiation, let Q 
consist of multiplication by x. Since, for any function / = f{x) we have 

D(xf) = f + xf 

we see that [P, Q] = id, so setting Z = id, we obtain the Heisenberg algebra. 

As an example with dim[g,g] ~ 2 we have (the complexification of) the Lie 
algebra of the group of Euclidean motions in the plane. Here we can find a basis 
h, x, y of g with brackets given by 

[h,x]=y, [h,y] = -x, [x,y]=0. 

More generally we could start with a commutative two dimensional algebra and 
adjoin an element h with ad h acting as an arbitrary linear transformation, A 
of our two dimensional space. 

The item of study of this chapter is the algebra sl(2) of all two by two 
matrices of trace zero, where [g,g] = g. 

2.2 si (2) and its irreducible representations. 

Indeed sl(2) is spanned by the matrices: 

* = (o -°i)' e= (!! o)' /= (° o)- 

They satisfy 

[h,e] = 2e, [/*,/] = -2/, [e,f] = h. 

Thus every element of sl(2) can be expressed as a sum of brackets of elements 
of sl(2), in other words 

[sl{2),sl{2)} = sl{2). 
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The bracket relations above are also satisfied by the matrices 



P2(h) 



the matrices 




P2(e) 



p 3 (h) r- 



/3 

1 



\0 






-1 








-v 



Ps(e) := 







2 



















I 








■) 


, P2(f) ■= 


e 





•) 








v» 













2 










f° 


3 











(o 








o\ 










2 





, p 3 (f) 




1 






















1 







2 










\o 



















3 


0/ 



and, more generally, the (n + 1) x (n + 1) matrices given by 



Pn(h) := 



(n 










n - 2 










-n + 2 



Pn{e) 



-n) 



/0 n 





\0 



n-1 







1 

0/ 



Pn(/) ^ 



/0 

1 





0/ 



\0 

These representations of s/(2) are all irreducible, as is seen by successively 
applying p n (e) to any non-zero vector until a vector with non-zero element in 
the first position and all other entries zero is obtained. Then keep applying 
p n (f) to fill up the entire space. 

These are all the finite dimensional irreducible representations of sl(2) as 
can be seen as follows: In U(sl(2)) we have 



[hj k ] 
[ej k ] 



-2kf\ [h,e k ] = 2ke k 
-fc(fc- l)/*- 1 + kf k - 1 h. 



(2.1) 
(2.2) 



Equation (2.1) follows from the fact that bracketing by any element is a deriva- 
tion and the fundamental relations in sl(2). Equation (2.2) is proved by induc- 
tion: For k = 1 it is true from the defining relations of sl(2). Assuming it for 
k, we have 



[e,f 



: ] = [e,f]f k + f[e,f k ] 

= hf k -k(k-l)f k + kf k h 

= [h,f k } + f k h-k(k-l)f k + kf k h 

= -2kf k -k(k-l)f k + (k + l)f k h 

= -{k + l)kf k + {k+l)f k h. 
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We may rewrite (2.2) as 
1 



e — f k 
' k\ 



In any finite dimensional module V, the element h has at least one eigenvector. 
This follows from the fundamental theorem of algebra which assert that any 
polynomial has at least one root; in particular the characteristic polynomial of 
any linear transformation on a finite dimensional space has a root. So there is 
a vector w such that hw = /iw for some complex number /i. Then 

h(ew) — [h, e]w + ehw — 2ew + \iew = {[i + 2){ew). 

Thus ew is again an eigenvector of h, this time with eigenvalue /z+2. Successively 
applying e yields a vector v x such that 

hv x = Aw A , ev x = 0. (2.4) 

Then U(sl(2))v\ is an invariant subspace, hence all of V. We say that v is a 
cyclic vector for the action of g on V if U(g)v = V, 

We are thus led to study all modules for sl(2) with a cyclic vector v\ satis- 
fying (2.4). In any such space the elements 

span, and are eigenspaces of h of weight A — 2k. For any A e C we can construct 
such a module as follows: Let b + denote the subalgebra of sl(2) generated by 
h and e. Then J7(b+), the universal enveloping algebra of b + can be regarded 
as a subalgebra of U(sl(2)). We can make C into a b + module, and hence a 
U(h + ) module by 

h-l:=X, e-l:=0. 



Then the space 



U{sl{2))® u(b+) C 



with e acting on C as and h acting via multiplication by A is a cyclic module 
with cyclic vector v\ = 1 ® 1 which satisfies (2.4). It is a "universal" such 
module in the sense that any other cyclic module with cyclic vector satisfying 
(2.4) is a homomorphic image of the one we just constructed. 

This space U(sl(2)) ®j/(b+) C is infinite dimensional. It is irreducible unless 
there is some j^ f k v\ with 

where k is an integer > 1 . Indeed, any non-zero vector w in the space is a finite 
linear combination of the basis elements ^f h v\; choose k to be the largest 
integer so that the coefficient of the corrresponding element does not vanish. 
Then successive application of the clement e (/c-times) will yield a multiple of 
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v\, and if this multiple is non-zero, then U(sl(2))w 
space. 
But 



U(sl(2))v\ is the whole 



v x = (1 - k + A) 



1 



This vanishes only if A is an integer and k = A + 1, in which case there is a 
unique finite dimensional quotient of dimension k + 1 . QED 

The finite dimensional irreducible representations having zero as a weight 
are all odd dimensional and have only even weights. We will call them "even". 
They are called "integer spin" representations by the physicists. The others are 
"odd" or "half spin" representations. 



2.3 The Casimir element. 

In U(sl(2)) consider the element 

C := 1 -h 2 + ef + fe (2.5) 

called the Casimir element or simply the "Casimir" of sl(2). 
Since ef = fe + [e, f] = fe + h in U(sl(2)) we also can write 

C = l -h 2 + h + 2fe. (2.6) 

This implies that if v is a "highest weight vector" in a sl(2) module satisfying 
ev = 0, hv = Xv then 

Cv=\\{\ + 2)v. (2.7) 



Now in U(sl(2)) we have 



and 



[h,C] = 2([h,f}e + f[h,e\) 
= 2(-2/e + 2/e) 
= 



[C, e] = ^ • 2(eh + he) + 2e - 2he 



eh — he + 2e 
-[h,e] + 2e 
0. 



Similarly 



[<?,/] =0. 
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In other words, C lies in the center of the universal enveloping algebra of sl(2), 
i.e. it commutes with all elements. If V is a module which possesses a "highest 
weight vector" v\ as above, and if V has the property that v\ is a cyclic vector, 
meaning that V = U(L)v\ then C takes on the constant value 

2 

since C is central and v\ is cyclic. 

2.4 sl{2) is simple. 

An ideal J in a Lie algebra g is a subspace of g which is invariant under the 
adjoint representation. In other words, / is an ideal if [g, /] C I. If a Lie 
algebra g has the property that its only ideals are and g itself, and if g is not 
commutative, we say that g is simple. Let us prove that sl(2) is simple. Since 
sl(2) is not commutative, we must prove that the only ideals are and sl(2) 
itself. We do this by introducing some notation which will allow us to generalize 
the proof in the next chapter. Let 

g = sl(2) 

and set 

g-i := C/, g := Ch, gi := Ce 

so that g, as a vector space, is the direct sum of the three one dimensional 
spaces 

g = g-i ©go ©gi- 
Correspondingly, write any x € g as 

X = X_i + Xq + X\. 

If we let 




then we have 

x — X-i + x + x\, 
[d,x] = — ie-i + O + xi, and 
[d, [d, x]] = z_i + + x\. 

Since the matrix 

(v : i) 

is invertible, we see that we can solve for the "components" x^i,x and x\ in 
terms of x, [d,x], [d, [d,x]]. This means that if / is an ideal, then 



/ = h ® Io © h 
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where 

J_i :=/n g _i, i :=/n g0 , h ■= Jngi. 

Now if 7 7^ then d = £ I, and hence e = [d, e] and / = — [d, /] also belong 
to I so J = sZ(2). If /-i 7^ so that / e I, then ft = [e, /] e 7 so 7 = sZ(2). 
Similarly, if h ^ so that eel then ft = [e, /] e 7 so 7 = sZ(2). 

Thus if I ^ then 7 = sl{2) and we have proved that sZ(2) is simple. 

2.5 Complete reducibility. 

We will use the Casimir element C to prove that every finite dimensional rep- 
resentation W of sZ(2) is completely reducible, which means that if W is an 
invariant subspace there exists a complementary invariant subspace W" so that 
W = W W". Indeed we will prove: 

Theorem 2 L Every finite dimensional representation of sZ(2) is completely 
reducible. 

2. Each irreducible subspace is a cyclic highest weight module with highest 
weight n where n is a non-negative integer. 

3. When the representation is decomposed into a direct sum of irreducible 
components, the number of components with even highest weight is the 
multiplicity of as an an eigenvector of ft and 

4- the number of components with odd highest weight is the multiplicity of 1 
as an eigenvalue of ft. 

Proof. We know that every irreducible finite dimensional representation is a 
cyclic module with integer highest weight, that those with even highest weight 
contain as an eigenvalue of ft with multiplicity one and do not contain 1 as 
an eigenvalue of ft, and that those with odd highest weight contain 1 as an 
eigenvalue of ft with multiplicity one, and do not contain as an eigenvalue. So 
2), 3) and 4) follow from 1). We must prove 1). 
We first prove 

Proposition 1 Let O^V^W—fk^Obean exact sequence of sZ(2) 
modules and such that the action of si (2) on k is trivial (as it must be, since 
sZ(2) has no non-trivial one dimensional modules). Then this sequence splits, 
i.e. there is a line in W supplementary to V on which sZ(2) acts trivially. 

This proposition is, of course, a special case of the theorem we want to prove. 
But we shall see that it is sufficient to prove the theorem. 

Proof of proposition. It is enough to prove the proposition for the case 
that V is an irreducible module. Indeed, if V\ is a submodule, then by induction 
on dim V we may assume the theorem is known for — ► V/V\ — * W/V\ — > k — > 
so that there is a one dimensional invariant subspace M in W/V\ supplementary 



42 



CHAPTER 2. SL(2) AND ITS REPRESENTATIONS. 



to V/Vi on which the action is trivial. Let N be the inverse image of M in W. 
By another application of the proposition, this time to the sequence 

0— >Vi— »JV— >M— »0 

we find an invariant line, P, in TV complementary to V\. So N = V\® P. Since 
(W/Vi) = (V/Vi) M we must have PnV= {0}. But since dim = dim 
V + 1, we must have = V P. In other words P is a one dimensional 
subspace of W which is complementary to V. 

Next we are reduced to proving the proposition for the case that si (2) acts 
faithfully on V. Indeed, let / = the kernel of the action on V. Since sl(2) is 
simple, cither I = si (2) or I = 0. Suppose that / = sl(2). For all x £ sl(2) we 
have, by hypothesis, xW C V, and for x £ I = si (2) we have xV = 0. Hence 

[sl(2),sl(2)] = sl(2) 

acts trivially on all of W and the proposition is obvious. So we are reduced 
to the case that V is irreducible and the action, p, of sl(2) on V is injective. 
We have our Casimir element C whose image in End W must map W — > V 
since every element of sl(2) does. On the other hand, C = \n(n + 2) Id ^ 
since we are assuming that the action of sl(2) on the irreducible module V is 
not trivial. In particular, the restriction of C to V is an isomorphism. Hence 
ker C p : W — > V is an invariant line supplementary to V. We have proved the 
proposition. 

Proof of theorem from proposition. Let — > E' — > be an exact 
sequence of sZ(2) modules, and we may assume that 7^ 0. We want to find an 
invariant complement to E' in E. Define W to be the subspace of Homk(E, E') 
whose restriction to E' is a scalar times the identity, and let V C W be the 
subspace consisting of those linear transformations whose restrictions to E' is 
zero. Each of these is a submodule of End(S). We get a sequence 

and hence a complementary line of invariant elements in W. In particular, we 
can find an element, T which is invariant, maps E — > E' , and whose restriction 
to E' is non-zero. Then ker T is an invariant complementary subspace. QED 



2.6 The Weyl group. 

We have 

expe= |^ and exp— / = 

so 



(exp e) (exp -/) (exp e) = 




Since 

exp ad x = Ad(exp x) 
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we see that 

t := (exp ad e) (exp ad (— /))(expade) 
consists of conjugation by the matrix 



Thus 




and similarly r(/) = — e. In short 

t : e i — > — /, / i ► — e, h i— > —h. 

In particular, r induces the "reflection" ft, i— » —ft on Cft and hence the reflection 
/i i > — /x (which we shall also denote by s) on the (one dimensional) dual space. 
In any finite dimensional module V of sl(2) the action of the element r = 
(exp e) (exp— /)(expe) is defined, and 

(t)- 1 /^) = Ad (t- 1 ) (ft) = s^h = sh 

so if 

hu = fiu 

then 

h(ru) — T(r)~ 1 h(T)u — rs(h)u — —firu — (s/i)tu. 

So if 

V M : {u e y|ftu = /Um} 

then 

r(ty - ^. (2.8) 

The two element group consisting of the identity and the element s (acting 
as a reflection as above) is called the Weyl group of s/(2). Its generalization 
to an arbitrary simple Lie algebra, together with the generalization of formula 
(2.8) will play a key role in what follows. 
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Chapter 3 



The classical simple 
algebras. 



In this chapter we introduce the "classical" finite dimensional simple Lie al- 
gebras, which come in four families: the algebras sl(n + 1) consisting of all 
traceless (n + 1) x (n + 1) matrices, the orthogonal algebras, on even and odd 
dimensional spaces (the structure for the even and odd cases are different) and 
the symplectic algebras (whose definition we will give below). We will prove 
that they are indeed simple by a uniform method - the method that we used 
in the preceding chapter to prove that sl(2) is simple. So we axiomatize this 
method. 



3.1 Graded simplicity. 

We introduce the following conditions on the Lie algebra g: 



g 




oo 
i=-l 


(3.1) 


[Si,Sj] 


C 


Si+j 


(3.2) 


[gl,g-l] 




go 


(3.3) 


[g-i.s] =0 


=> 


x = 0, V x e su Vi > 


(3.4) 


There exists a d € go 


satisfying 


[d, x] = kx, x e gfe, Vfc, 


(3.5) 



and 

g_i is irreducible under the (adjoint) action of go- (3-6) 

Condition (3.4) means that if x e g i; i > is such that [y, x) = for all y e g_i 
then x = 0. 

We wish to show that any non-zero g satisfying these six conditions is simple. 
We know that g-i,go and gi are all non-zero, since ^ d £ go by (3.5) and 
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[g-i,gi] = go by (3.3). So g can not be the one dimensional commutative 
algebra, and hence what we must show is that any non-zero ideal / of g must 
be all of g. 

We first show that any ideal I must be a graded ideal, i.e. that 

/ = I-x © Iq © h © • • • j where Ij := I n gj. 

Indeed, write any x <G g as x = X-\ + x + x\ + ■ ■ ■ + Xk and successively bracket 
by d to obtain 

x = x_i+x + xiH \- x k 

[d,x] = + + .ti H h kxk 

[d,[d,x]] = x_i+0 + tiH h k 2 x k 



(&dd) k x 
{a.dd) k+1 x 



(-1)' £ .t_i + + xi H h k k x k 

(-l) k+1 x^ + + x r + ■ ■ ■ + k k+: 



x k . 



The matrix 



/ 1 



{-l) k 
V( _ 1)fe+ i 



1 1 
1 



1 \ 

k 

k k 
k k+1 J 



is non singular. Indeed, it is a van der Monde matrix, that is a matrix of the 
form 



/ 1 



Vl +1 



l 

*2 



1 \ 

tk+2 



L k+2 
j-k+1 
L k+2, 



whose determinant is 



U.(U-tj) 



i<j 



and hence non-zero if all the tj are distinct. Since t\ = — 1, t<i = 0, t$ = 1 etc. in 
our case, our matrix is invertible, and so we can solve for each of the components 
of x in terms of the (add) J x. In particular, if x E I then all the (add)- 7 a; € I 
since / is an ideal, and hence all the component Xj of x belong to / as claimed. 

The subspace I-i c g-i is invariant under the adjoint action of go on 
g_i, and since we are assuming that this action is irreducible, there are two 
possibilities: i_i = or I_i = g_!. We will show that in the first case 1 = 
and in the second case that / = g. 

Indeed, if 7_i = we will show inductively that Ij = for all j > 0. Suppose 
7^ y G go- Since every element of [J_i, y] belongs to I and to g_i we conclude 
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that [g_i,y] = and hence that y — by (3.4). Thus Iq = 0. Suppose that we 
know that Ij-i = 0. Then the same argument shows that any y € Ij satisfies 
[g-i, y] = and hence y = 0. So Ij = for all j, and since / is the sum of all 
the Ij we conclude that 1 = 0. 

Now suppose that 7_i = g_ x . Then g = [g-i,gi] = [i-i,gi] C /. Fur- 
thermore, since d e g C / we conclude that gk C I for all k ^ since every 
clement y of such a g& can be written as y = £ [d, y] G /. Hence I = g. QED 

For example, the Lie algebra of all polynomial vector fields, where 

d 

gk = X 1 -^ — X 1 homogenous polynomials of degree k + 1} 

is a simple Lie algebra. Here d is the Euler vector field 

d d 

d = x 1 - 1 hx„- — . 

OX i ox n 

This algebra is infinite dimensional. We are primarily interested in the finite 
dimensional Lie algebras. 



3.2 s/(n + l) 

Write the most general matrix in sl(n + 1) as 

/ -trA w* \ 
{ v A ) 

where A is an arbitrary nxn matrix, v is a column vector and w* = (wi , . . . , w n ) 
is a row vector. Let g_i consist of matrices with just the top row, i.e. with 
v = A = 0. Let gi consist of matrices with just the left column, i.e. with 
A = w* = 0. Let go consist of matrices with just the central block, i.e. with 
v = w* = 0. Let 

where I is the nxn identity matrix. Thus go acts on p_i as the algebra of all 
endomorphisms, and so g_i is irreducible. We have 

JO \ ( w* V / -(w*,v) \ 
\ v J ' V /' ^ v®w* J ' 

where (w*,v) denotes the value of the linear function w* on the vector v, and 
this is precisely the trace of the rank one linear transformation v ® w* . Thus 
all our axioms are satisfied. The algebra sl(n + 1) is simple. 
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3.3 The orthogonal algebras. 

The algebra o(2) is one dimensional and (hence) commutative. In our (real) 
Euclidean three dimensional space, the algebra o(3) has a basis X. Y. Z (in- 
finitesimal rotations about each of the axes) with bracket relations 

[X,Y]=Z, [Y,Z]=X, [Z,X] = Y, 

(the usual formulae for "vector product" in three dimensions" . But we are over 
the complex numbers, so can consider the basis X + iY, —X + iY, iZ and find 
that 

[iZ,X + iY] = X + iY, [iZ,-X + iY} = -(-X + iY), [X + iY, -X + iY] = HZ. 

These are the bracket relations for sl(2) with e = X + iY, f = —X + iY, h = 
iZ. In other words, the complcxification of our three dimensional world is the 
irreducible three dimensional representation of sl(2) so o(3) = sl(2) which is 
simple. 

To study the higher dimensional orthogonal algebras it is useful to make two 
remarks: 

If V is a vector space with a non-degenerate symmetric bilinear form ( , ), 
we get an isomorphism of V with its dual space V* sending every u e V to the 
linear function l u where l u (v) = (v,u). This gives an identification of 

End(V) = V®V* with V®V. 

Under this identification, the elements of o(V) become identified with the anti- 
symmetric two tensors, that is with elements of A 2 (V). (In terms of an or- 
thonormal basis, a matrix A belongs to o(V) if and only if it is anti-symmetric.) 

Explicitly, an element uAv becomes identified with the linear transformation 
A uAv where 

A uAv x = (x, v)u - (u, x)v. 

This has the following consequence. Suppose that z e V with (z, z) ^ 0, and 
let w be any element of V. Then 

A wAz z = (z, z)w - (z, w)z 

and so U(o(V))z = V. On the other hand, suppose that u e V with (u,u) = 0. 
We can find v G V with (v, v) = and (v, u) = 1. Now suppose in addition that 
dim V > 3. We can then find a z € V orthogonal to the plane spanned by u 
and v and with (z, z) = 1. Then 

A z /w u = z , 

so z <G U(o(V))e and hence U(o(V))u = V. We have proved: 

1 // dim V > 3, then every non-zero vector in V is cyclic, i.e the representa- 
tion of o(V) on V is irreducible. 
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(In two dimensions this is false - the line spanned by a vector e with (e, e) = 
is a one dimensional invariant subspace.) 
We now show that 

2 o(V) is simple for dim V > 5. 

For this, begin by writing down the bracket relations for elements of o(V) in 
terms of their parametrization by elements of h 2 V . Direct computation shows 
that 

[A uAv , A xAy ] = (v, x)A uAy - (u, x)A vAy - (v , y)A uAx + (u, y)A vAx . (3.7) 
Now let n = dim V — 2 and choose a basis 

ti, v , X\ , . . . , x n 

of V where 

(u, u) = (u, Xi) = (v, v) = (v, Xi) = Vi, (u,v) = l, (xi,Xj) = 8ij. 
Let g := o(V) and write W for the subspace spanned by the Xi. Set 

d • A uAv 

and 

g_i := {A vAx , x G W^}, g0 := o(W) ® Cd, gi := {A uAxi x e VK}. 

It then follows from (3.7) that d satisfies (3.5). The spaces g_i and gi look like 
copies of W with the o(W) part of go acting as o(W), hence irreducibly since 
dim IF > 3. All our remaining axioms are easily verified. Hence o(V) is simple 
for dim V > 5. 

We have seen that o(3) = sl(2) is simple. 

However o(4) is not simple, being isomorphic to sl(2) © sl(2): Indeed, if Z\ 
and Zi are vector spaces equipped with non-degenerate anti-symmetric bilinear 
forms ( , )i and ( , )i then Z\ ® Z<i has a non-degenerate symmetric bilinear form 
( , ) determined by 

(ui O u 2 , ui ® u 2 ) = (ui,v 1 } 1 (u 2 ,v 2 )2- 

The algebra sZ(2) acting on its basic two dimensional representation infinitesi- 
mally preserves the antisymmetric form given by 

(GO-GO)-*—*- 

Hence, if we take Z = Zi = Z 2 to be this two dimensional space, we see that 
sl(2) sl(2) acts as infinitesimal orthogonal transformations on Z ® Z which is 
four dimensional. But o(4) is six dimensional so the embedding of sl(2) sl(2) 
in o(4) is in fact an isomorphism since 3 + 3 = 6. 
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3.4 The symplectic algebras. 

We consider an even dimensional space with coordinates q\ , q 2 , . . . , pi , p 2 , 

The polynomials have a Poisson bracket 

This is clearly anti-symmetric, and direct computation will show that the Ja- 
cobi identity is satisfied. Here is a more interesting proof of Jacobi's identity: 
Notice that if / is a constant, then {/, g} = for all g. So in doing bracket 
computations we can ignore constants. On the other hand, if we take g to be 
successively gi, . . . , q n ,Pi, ■ ■ ■ ,Pn m (3-8) we see that the partial derivatives of / 
are completely determined by how it brackets with all g, in fact with all linear 
g. If we fix f , the map 

h ~ {/, h} 

is a derivation, i.e. it is linear and satisfies 

{f,h 1 h 2 } = {f,h 1 }h 2 + h 1 {f,h 2 }. 

This follows immediately from from the definition (3.8). Now Jacobi's identity 
amounts to the assertion that 

{{/, .9}, h} = {/, {g, h}} -{g, {f,h}}, 

i.e. that the derivation 

h~{{f,g},h} 
is the commutator of the of the derivations 

h i ► {/, h} and h i— > {g, h}. 

It is enough to check this on linear polynomials h, and hence on the polynomials 
qj and pk- If we take h = qj then 



so 



/ df d 2 g _ df d 2 g 
\dp l dq i dp dqidpidpj 
dg d 2 f dg d 2 f 



dpi dqidpj dqi dpidpj 



so 



d 

{f,{9,Qj}}-{g,{f,qj}} = ^ri/'ff} 

= {{f,g},Qj} 



as desired, with a similar computation for pk- 
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The symplectic algebra sp(2n) is defined to be the subalgebra consisting of 
all homogeneous quadratic polynomials. We divide these polynomials into three 
groups as follows: Let gi consist of homogeneous polynomials in the <?'s alone, 
so gi is spanned by the qiqj. Let g_i be the quadratic polynomials in the p's 
alone, and let g be the mixed terms, so spanned by the qiPj. It is easy to see 
that go ~ gl{n) and that [g-i,gi] = go- To check that g_i is irreducible under 
go, observe that [piqj , PkPi] = if j ^ k or £, and [piqj,PjPe] is a multiple of 
PiPi- So we can by one or two brackets carry any non-zero element of g_i into 
a non-zero multiply of pi , and then get any monomial from p\ by bracketing 
with piqi appropriately The element d is given by \{piq\ + ■ ■ ■ +p n Qn)- 

We have shown that the symplectic algebra is simple, but we haven't really 
explained what it is. Consider the space of V of homogenous linear polynomials, 
i.e all polynomials of the form 

£ = a 1 q 1 -\ \-a n q n + b x p q -\ h b n p n . 

Define an anti-symmetric bilinear form uj on V by setting 

w(i,C) := {£,£'). 

From the formula (3.8) it follows that the Poisson bracket of two linear functions 
is a constant, so u) does indeed define an antisymmetric bilinear form on V, 
and we know that this bilinear form is non-degenerate. Furthermore, if / is a 
homogenous quadratic polynomial, and £ is linear, then {/,£} is again linear, 
and if we denote the map 

by A = Af, then Jacobi's identity translates into 

lo(A£,£')+lu(£A£') = (3.9) 

since {£,£'} is a constant. Condition (3.9) can be interpreted as saying that A 
belongs to the Lie algebra of the group of all linear transformations R on V 
which preserve uj, i.e. which satisfy 

lj{M,R£') = lo{£,£'). 

This group is known as the symplectic group. The form u> induces an isomor- 
phism of V with V* and hence of Hom(V, V) — V <g> V* with V <8> V, and this 
time the image of the set of A satisfying (3.9) consists of all symmetric ten- 
sors of degree two, i.e. of S 2 (V). (Just as in the orthogonal case we got the 
anti-symmetric tensors). But the space S 2 (V) is the same as the space of ho- 
mogenous polynomials of degree two. In other words, the symplectic algebra as 
defined above is the same as the Lie algebra of the symplectic group. 

It is an easy theorem in linear algebra, that if V is a vector space which 
carries a non-degenerate anti-symmetric bilinear form, then V must be even 
dimensional, and if dim V = 2n then it is isomorphic to the space constructed 
above. We will not pause to prove this theorem. 
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3.5 The root structures. 

We are going to choose a basis for each of the classical simple algebras which 
generalizes the basis e, /, h that we chose for sl(2). Indeed, for each classical 
simple algebra g we will first choose a maximal commutative subalgebra h all 
of whose elements are semi-simple = diagonizable in the adjoint representation. 
Since the adjoint action of all the elements of h commute, this means that they 
can be simultaneously diagonalized. Thus we can decompose g into a direct 
sum of simultaneous eigenspaces 

g = h®0g Q (3.10) 

a 

where ^ a e h* and 

g Q := {x e g\[h, x] = a(h)x V h € h}. 

The linear functions a are called roots (originally because the a(h) are roots 
of the characteristic polynomial of ad(/i)). The simultaneous eigenspace g a is 
called the root space corresponding to a. The collection of all roots will usually 
be denoted by <&. 

Let us see how this works for each of the classical simple algebras. 



3.5.1 A n = sl(n + 1). 

We choose h to consist of the diagonal matrices in the algebra sl(n + 1) of all 
(n + 1) x (n + 1) matrices with trace zero. As a basis of h we take 

/l ••• 0\ 






-1 


• • 


• 







• • 


• <v 


/o 





••• 


o\ 







1 


••• 













-1 ••• 







\o 





••• 


v) 




/o 





••• 


o\ 








1 












... o 


-v 



Let Li denote the linear function which assigns to each diagonal matrix its 
i-th (diagonal) entry, 
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Let Eij denote the matrix with one in the i, j position and zero's elsewhere. 
Then 

[h,E ij ] = (L i (h)-L j (h))E ij V heh 
so the linear functions of the form 

Li-Lj, iy^j 

are the roots. 

We may subdivide the set of roots into two classes: the positive roots 

$+ — {Li-Lj; i<j} 

and the negative roots 

*" :=-$+ = {Lj - L h i<j}. 
Every root is cither positive or negative. If we define 

CLi := Li — L i+ i 
then every positive root can be written as a sum of the af. 

Li - Lj = Q.i + h cxj-i. 

We have 

ai(hi) = 2, 

and for i ^ j 

ai(h i±1 ) = -l, ai(hj) =0, j?i±l. (3.11) 

The elements 

Ei t i+\, hi, Ei + i t i 

form a subalgebra of sl(n + 1) isomorphic to sl(2). We may call it sZ(2)j. 
3.5.2 C n = sp{2n),n > 2. 

Let h consist of all linear combinations of piQi, . . . ,p n Qn an d let Li be defined 

by 

Li (aipiqi H h a n p n q n ) = a t 

so L\, . . . , L n is the basis of h* dual to the basis piqi, ■ ■ ■ ,p n q n of h. 
If h = Oipi^i H h a n p n q n then 

[h,q l q j ] = (a i + a j )q i q 3 
[h,q l p>] = (a i -a j )q t p> 
[h,p l p>] = -{a i + aj)p i p> 
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so the roots are 

±(Lj + Lj) all i,j and Li — Lj i ^ j. 
We can divide the roots $ into positive and negative roots by setting 
$+ = {L l + Lj}^ .. U {i, - L 3 } 1<3 . 

If we set 

ol\ := Li — L 2 , . . . , a n -i ■= L n _\ — L n , a n := 2L n 

then every positive root is a sum of the at. Indeed, + L n = a n -i + a n 

and 2L„_! = 2a„_ 1 + a n and so on. In particular 2a„_ 1 + a„ is a root. 
If we set 

hi := Piqi - P2Q2, • • • , K-l ■= Pn-lQn-l - PnQn, K := p n q n 

then 

oti(hi) = 2 

while for i =/= j 

a»(/i»±i) = -1, i = l, ...,n-l 

Oi(hj) = 0, j^i±l,i= l,...,n (3.12) 
a n (h n -i) = -2. 

In particular, the elements h i} qiPi+i , qi+iPi for i = 1, . . . , n— 1 form a subalgebra 
isomorphic to s/(2) as do the elements h n ,\q\, — \p\- We call these subalgebras 
sl(2)i, i = l, ...,n. 

3.5.3 L> n = o(2n), n > 3. 

We choose a basis Ui , . . . , u n , v\ , . . . , v n of our orthogonal vector space V such 
that 

(ui,Uj) = (vi,vj) = 0,V i,j, (ui,vj) = Sij. 

We let h be the subalgebra of o(V) spanned by the A UiVi , i = 1, . . . , n. Here we 
have written A xy instead of A xAy in order to save space. We take 

A A 

as a basis of h and let Li, . . . , L n be the dual basis. Then 

±L k ± L e k + I 
are the roots since from (3.7) we have 

[A Ui Vi 7 A UkUi ] = (5ik + 5ig)A UkUe 



3.5. THE ROOT STRUCTURES. 



55 



We can choose as positive roots the 

L k + Le,L k -L tl k < i 

and set 

OH := Li - L i+1 , i = 1, . . . ,n - 1, a n := L„_i + L r , 
Every positive root is a sum of these simple roots. If we set 

hi . A UiVi A Ui+lVi+l ,i 1, ... n 1, 



and 
then 

and for i ^ j 



h — A -4- A 

ai(hi) = 2 



cti(hj) = j/i±l, i = l,...n-2 
a»(h»±i) = -1 i = l,...,n-2 
a„_i(/i„_ 2 ) = -1 (3.13) 
a n {h n -2) = -1 
a n (h n -i) = 0. 

For i — 1, . . . , n — 1 the elements /i,, A u . Vi+1 , A Ui+lV . form a subalgebra isomor- 
phic to s/(2) as do /i„, J 4„ n _ 1 „ n ,^4„ n _ 1 „ n . 

3.5.4 £ n = o(2n + 1) ra > 2. 

We choose a basis m, . . . ,u n , v\, . . . , v n , x of our orthogonal vector space V such 
that 

(ui,Uj) = (vi,Vj) = 0,V {ui,vj) = Sij, 

and 

(x, Ui) = (x, Vi) = V i, (ar, x) = 1. 

As in the even dimensional case we let h be the subalgebra of o(V) spanned by 
the A UiV . , i = 1, . . . , n and take 

A A 

as a basis of h and let L 1; . . . , L„ be the dual basis. Then 

±Li ±Lj j, ±Li 

are roots. We take 

Li± Lj, 1 < i < j < n, together with Li, i = 1, . . . , n 
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to be the positive roots, and 



oti ■= Li - L i+ i, i = 1, . . . ,n - 1, a n := L. 



'n 



to be the simple roots. We let 



h — A — A 



■Iti+lVi+l ' 



i = 1, . . . n — 1, 



as in the even case, but set 



Then every positive root can be written as a sum of the simple roots 



ai(hi) = 2, i 



= l,...n, 



and for i ^ j 



a t (hj) 
ai(h i±l ) 
a n -i(h n ) 



j ^ i ± 1, i = 1,.. .n 
— 1 i = 1, . . . ,n — 2,n 

-2 



(3.14) 



Notice that in this case a n -i + 2a„ = + L„ is a root. Finally we can 

construct subalgebras isomorphic to sl(2), with the first n — 1 as in the even 
orthogonal case and the last sZ(2) spanned by /i„, A UnX , — A^^j;. 

3.5.5 Diagrammatic presentation. 

The information of the last four subsections can be summarized in each of the 
following four diagrams: 

The way to read this diagram is as follows: each node in the diagram stands 
for a simple root, reading from left to right, starting with ol\ at the left. (In the 
diagram Di the two rightmost nodes are a^_i and at, say the top at-\ and the 
bottom at-) Two nodes a* and aj are connected by (one or more) edges if and 
only if a l (h j ) ^ 0. 

In all cases, the difference, at — aj is never a root, and, for i ^ j, ai(hj) < 
and is an integer. If, for i ^ j, ai(hj) < then at + aj is a root. 

In two of the cases (Bt and Ct) it happens that ai(hj) = —2. Then + aj 
and Qj + 2a j are roots, and we draw a double bond with an arrow pointing 
towards aj. In this case 2 is is the maximum integer such that Qj + kaj is a 
root. In all other cases, this maximum integer k is one if the nodes are connected 
(and zero it they arc not). 

3.6 Low dimensional coincidences. 

We have already seen that o(4) <~ sl(2) © sl(2). We also have 



o(6) ~ sZ(4). 
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• — • — — «=>• B e e > 3 



• • •<=• Ci£>2 



D e l>4 



Figure 3.1: Dynkin diagrams of the classical simple algebras. 

Both algebras are fifteen dimensional and both are simple. So to realize this 
isomorphism we need only find an orthogonal representation of si (4) on a six 
dimensional space. If we let V — C 4 with the standard representation of si (4), 
we get a representation of si(4) on A 2 (V) which is six dimensional. So we must 
describe a non-degenerate bilinear form on A 2 V which is invariant under the 
action of si (4). We have a map, wedge product, of 

A 2 V x a 2 v -» A 4 V. 

Furthermore this map is symmetric, and invariant under the action of gl{4). 
However si(4) preserves a basis (a non-zero element) of A 4 V and so we may 
identify A 4 V with C. It is easy to check that the bilinear form so obtained is 
non-degenerate 

We also have the identification 

sp(4) ~ o(5) 

both algebras being ten dimensional. To see this let V — C 4 with an antisym- 
metric form co preserved by Sp(4) . Then to <g> ui induces a symmetric bilinear 
form on V ® V as we have seen. Sitting inside V (8) V as an invariant subspace 
is A 2 y as we have seen, which is six dimensional. But A 2 V is not irreducible as 
a representation of sp(4). Indeed, to e A 2 V* is invariant, and hence its kernel is 
a five dimensional subspace of A 2 V which is invariant under sp(4). We thus get 
a non-zero homomorphism sp(4) — > o(5) which must be an isomorphism since 
sp{4) is simple. 
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These coincidences can be seen in the diagrams. If we were to allow I = 2 in 
the diagram for Bg it would be indistinguishable from Ci- If we were to allow 
t = 3 in the diagram for it would be indistinguishable from A 3 . 

3.7 Extended diagrams. 

It follows from Jacobi's identity that in the decomposition (3.10), we have 

[go, go'] C g a+a > (3.15) 

with the understanding that the right hand side is zero if a + a' is not a root. 
In each of the cases examined above, every positive root is a linear combination 
of the simple roots with non-negative integer coefficients. Since the algebra is 
finite, there must be a maximal positive root (3 in the sense that [3 + on is not 
a root for any simple root. For example, in the case of A n = sl(n+ 1), the root 
(3 := L\ — L n+ \ is maximal. The corresponding g^j consists of all (n+ 1) x (n+ 1) 
matrices with zeros everywhere except in the upper right hand corner. We can 
also consider the minimal root which is the negative of the maximal root, so 

a o —p = L n+ \ — L\ 

in the case of A n . Continuing to study this case, let 

h := h n+1 — hi. 

Then we have 

ai(hi) = 2, i = 0,...n 

and 

a (/ii) = a (h n ) = -1, a (hi)=0, i^0,l,n. 

This means that if we write out the (n + 1) x (n + 1) matrix whose entries are 
ai(hj), i,j = 0,...nwe obtain a matrix of the form 

21 - M 

where My = 1 if and only if j = ±1 with the understanding that n+ 1 = and 
— 1 = n, i.e we do the subscript arithmetic mod n. In other words, M is the 
adjacency matrix of the cyclic graph with n+l vertices labeled 0,...n. Also, 
we have 

ho + hi H h h n = 0. 

If we apply a, to this equation for i = 0, ... n we obtain 

(2/-M)l = 0, 

where 1 is the column vector all of whose entries are 1. We can write this 
equation as 

Ml = 21. 
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In other words, 1 is an eigenvector of M with eigenvalue 2. 

In the chapters that follow we shall see that any finite dimensional simple Lie 
algebra has roots, simple roots, maximal roots etc. giving rise to a matrix M 
with integer entries which is irreducible (in the sense of non-negative matrices - 
definition later on) and which has an eigenvector with positive (integer) entries 
with eigenvalue 2. This will allow us to classify the simple (finite dimensional) 
Lie algebras. 



CHAPTER 3. THE CLASSICAL SIMPLE ALGEBRAS. 



Chapter 4 

Engel- L ie- C art an- Wey 1 



We return to the general theory of Lie algebras. Many of the results in this 
chapter are valid over arbitrary fields, indeed if we use the axioms to define 
a Lie algebra over a ring many of the results are valid in this generality. But 
some of the results depend heavily on the ring being an algebraically closed field 
of characteristic zero. As a compromise, throughout this chapter we deal with 
fields, and will assume that all vector spaces and all Lie algebras which appear 
are finite dimensional. We will indicate the necessary additional assumptions on 
the ground field as they occur. The treatment here follows Serre pretty closely. 

4.1 Engel's theorem 

Define a Lie algebra g to be nilpotent if: 

3n\ [xi, [x 2 , . . .x n+1 ] . . .] = V xi, . . . ,x n+1 £ g. 

Example: n + := n + (gl(d)) := all strictly upper triangular matrices. Notice 
that the product of any d + 1 such matrices is zero. 

The claim is that all nilpotent Lie algebras arc essentially like n + . 

We can reformulate the definition of nilpotent as saying that the product of 
any n operators adx^ vanishes. One version of Engel's theorem is 

Theorem 3 g is nilpotent if and only z/adx is a nilpotent operator for each 

X £ g. 

This follows (taking V = g and the adjoint representation) from 

Theorem 4 Engel Let p : g — > End(V) be a representation such that p(x) is 
nilpotent for each x £ g. Then there exists a basis in terms of which p(g) C 
n + (gl(d)), i.e. becomes strictly upper triangular. Here d =dim V. 

Given a single nilpotent operator, we can always find a non-zero vector, v 
which it sends into zero. Then on V/{v} a non-zero vector which the induced 
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map sends into zero etc. So in terms of such a flag, the corresponding matrix 
is strictly upper triangular. The theorem asserts that we are can find a single 
flag which works for all p(x). In view of the above proof for a single operator, 
Engel's theorem follows from the following simpler looking statement: 

Theorem 5 Under the hypotheses of Engel's theorem, ifV^O, there exists a 
non-zero vector v € V such that p(x)v — Vx € g. 

Proof of Theorem 5 in seven easy steps. 

• Replace g by its image, i.e. assume that g C End V. 

• Then (ad x)y = L x y — R x y where L x is the linear map of End V into itself 
given by left multiplication by x, and R x is given by right multiplication 
by x. Both L x and R x are nilpotent as operators since x is nilpotent. 
Also they commute. Hence by the binomial formula (ad x) n = (L x — R x ) n 
vanishes for sufficiently large n. 

• We may assume (by induction) that for any Lie algebra, m, of smaller 
dimension than that of g (and any representation) there exists a v € V 
such that xv = \/x e m. 

• Let k C g be a subalgebra, k^g, and let 

N = N(k) := {x G g|(ada;)k C k} 

be its normalizer. The claim is that 

3 N(k) is strictly larger than k. 

To see this, observe that each x £ k acts on k and on g/k by nilpotent 
maps, and hence there is an ^ y e g/k killed by all x e k. But then 
y £ k, and [y, x] — —[x,y] ek for all i£k. So y e N(k), y £ k. 

• If g ^ 0, there is an ideal i C g such that dimg/i = 1. Indeed, let i be a 
maximal proper subalgebra of g. Its normalizer is strictly larger, hence all 
of g, so i is an ideal. The inverse image in g of a line in g/i is a subalgebra, 
and is strictly larger than i. Hence it must be all of g. 

• Choose such an ideal, i. The subspace 

W C V, W = {v\xv = 0, \Jx e i} 

is invariant under g. Indeed, if y <G g, w G W then xyw = yxw + [x, y]w = 
0. 

• W 7^ by induction. Take y e g, y £ i. It preserves W and is nilpotent. 
Hence there is a non-zero v € W with yv — 0. Since y and i span g, we 
have xv = Vi€g. QED 

No assumptions about the ground field went into this. 
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4.2 Solvable Lie algebras. 

Let g be a Lie algebra. D n g is defined inductively by 

D°g:=g, D\g):=[g,g},..., D n+1 g:=[D n g,D n g}. 

If we take b to consist of all upper triangular n x n matrices, then D 1 b = n + 
consists of all strictly triangular matrices and then successive brackets eventually 
lead to zero. We claim that the following conditions are equivalent and any Lie 
algebra satisfying them is called solvable. 

1. 3n \D n g = 0. 

2. 3n such that for every family of 2™ elements of g the successive brackets 
of brackets vanish; e.g for n = 4 this says 

[[[[zi,£2], [2:3,2:4]], [[x 5 ,x 6 ], [x 7 ,x 8 ]]], [[[2:9,2:10], [2:11,2:12]], [[2:13.2:14], [xi5,2;i 6 ]]]] = 0. 

3. There exists a sequence of subspaces g := ii D i 2 D • • • D i„ — such each 
is an ideal in the preceding and such that the quotient ij/ij+i is abelian, 
i.e. C i J+ i. 

Proof of the equivalence of these conditions, [g, g] is always an ideal in 
g so the Dig form a sequence of ideals demanded by 3), and hence 1) => 3). We 
also have the obvious implications 3) 2) and 2) 1). So all these definitions 
are equivalent. 

Theorem 6 [Lie.] Let g be a solvable Lie algebra over an algebraically closed 
field k of characteristic zero, and (p, V) a finite dimensional representation of g. 
Then we can find a basis ofV so that p(g) consists of upper triangular matrices. 

By induction on dim V this reduces to 

Theorem 7 [Lie.] Under the same hypotheses, there exists a (non-zero) com- 
mon eigenvector v for all the p(y), i.e. there is a vector v e V and a function 
X ■ g — > k such that 

p(y) v = x(y) v V y £ g. (4.1) 

Lemma 2 Suppose that i is an ideal of g and (4-1) holds for all y £ i. Then 

X{[x,h]) = 0, Vxeg h£i. 

Proof of lemma. For x £ g let Vi be the subspace spanned by v, xv, . . . , x l ~ 1 v 
and let n > be minimal such that V n = V n +i- So V n is finite dimensional and 
xV n c V n . Also V n = V n+k Vfc. 



G4 



CHAPTER 4. ENGEL-LIE-CARTAN-WEYL 



Also, for ft € i, (dropping the p) we have: 

hv — x{h)v 

hxv — xhv — [x, h]v 

= x(h) xv mod Vi 

hx 2 v — xhxv + [ft, x]xv 

= x(h)x 2 v + uxv, mod V\ u <E I 

= x(h)x 2 v + x(u)xv mod V\ 

= x(h)x 2 v mod V 2 

hx l v = x{h)x 1 v mod Vi. 

Thus V n is invariant under i and for each h £ i, tr|y n ft = nx(h). In particular 
both a; and /i leave V n invariant and triy n [x, h] — since the trace of any 
commutator is zero. This proves the lemma. 

Proof of theorem by induction on dim g, which we may assume to be positive. 
Let m be any subspace of g with g D m D [g, g] . Then [g, m] C [g, g] C m so m 
is an ideal in g. In particular, we may choose m to be a subspace of codimension 
1 containing [g,g]. By induction we can find a v <G V and a \ : m — > k such 
that (4.1) holds for all elements of m. Let 

W := {w e V\hw = x{h)w V h G m}. 

If x e g, then 

/isw = x/iw — [x, ft] to = x(h)xw — xd^j = x(h)xw 

since x([x, ft]) = by the lemma. Thus W is stable under all of g. Pick 
x e g, x ^ m, and let w € be an eigenvector of x with eigenvalue A, say. 
Then v is a simultaneous eigenvector for all of g with \ extended as 

X (h + rx)=x(h) + rX. QED 

We had to divide by n in the above argument. In fact, the theorem is not 
true over a field of characteristic 2, with si (2) as a counterexample. 

Applied to the adjoint representation, Lie's theorem says that there is a flag 
of ideals with commutative quotients, and hence [g, g] is nilpotcnt. 

4.3 Linear algebra 

Let V be a finite dimensional vector space over an algebraically closed field of 
characteristic zero, and let 



det(T/ - u) = JJ(T - A*) 
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be the factorization of its characteristic polynomial where the Xi are distinct. 
Let S(T) be any polynomial satisfying 

S(T) = X, mod (T - Ai) m * , S(T) = mod T, 

which is possible by the Chinese remainder theorem. For each i let Vi :— the 
kernel of (u — \i) mi . Then V = ® Vi and on Vi, the operator S(u) is just the 
scalar operator Xil. In particular s — S(u) is semi-simple (its eigenvectors span 
V) and, since s is a polynomial in u it commutes with u. So 

u = s + n 

where 

n = N(u), N(T)=T-S{T) 

is nilpotent. Also 

ns — sn. 

We claim that these two elements arc uniquely determined by 

u = s + n, sn = ns, 

with ,s scmisimplc and n nilpotent. Indeed, since sn = ns, su = us so s(u — 
Xi) k = (u— Xi) k s so sVi C Vi. Since s — u is nilpotent, s has the same eigenvalues 
on as u does, i.e. A^. So s and hence n is uniquely determined. 

If P(T) is any polynomial with vanishing constant term, then if A C B 
are subspaces with uB C A then P(u)B C A. So, in particular, sB C A and 
nB C A. 

Define 

V^,, := V <g> V <g> • • • <g> ^ <g> V* <g> • • • <g> V* 

with p copies of V and q copies of V*. Let w e End(V) act on by — u* and 
on V pq by derivation, so , for example, 

Ui2 = u®l®l-1®M* 01-101® M*. 

Similarly, u n acts on V^i = V ® y* by 

u\\(x ® t) — ux <E> £ — x (g> u*£. 

Under the identification of V ®V* with End(U), the element x(&l acts on y € V 
by sending it into 

£(y)x. 

So the element Un(x <X> £) sends y to 

^(y)u(a:) - (u*£)(y)x = £{y)u{x) - £(u(y))x. 

This is the same as the commutator of the operator u with the operator (cor- 
responding to) x ® £ acting on y. In other words, under the identification of 
V ®V* with End(F), the linear transformation un gets identified with adu. 
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Proposition 2 If u = s + n is the decomposition of u then u pq = s pq + n pq is 
the decomposition of u pq . 

Proof. [s pqi n pq \ — and the tensor products of an eigenbasis for s is an 
eigenbasis for s pq . Also n pq is a sum of commuting nilpotents hence nilpotent. 
The map u i— > u pq is linear hence u pq — s pq + n pq . QED 

If (j) ■ k — > k is a map, we define <p(s) by <fi(s)\ v . — 4>(Xi). If we choose a 
polynomial such that P(0) = 0, P(\i) = <p{M) then P(u) — 4>(s). 

Proposition 3 Suppose that <f> is additive. Then 



Proof. Decompose V pq into a sum of tensor products of the Vi or V* . On each 
such space we have 



where the middle equation is just the additivity. QED 
As an immediate consequence we obtain 

Proposition 4 Notation as above. If A C B C V p _ q with u pq B C A then for 
any additive map, (f>(s) pq B C A 

Proposition 5 (over C) Let u = s + n as above. If tr(ucj)(s)) = for (f)(s) = s 
then u is nilpotent. 

Proof. tru(j)(s) = J2 m i^i^i = J2 m i\^i\ 2 - So the condition implies that all the 
A, 0. QED 

4.4 Cartan's criterion. 

Let g C End(V) be a Lie subalgebra where V is finite dimensional vector space 
over C. Then 



Proof. Suppose g is solvable. Choose a basis for which g is upper triangular. 
Then every y E [g,g] has zeros on the diagonal, Hence tr(xy) — 0. For the 
reverse implication, it is enough to show that [g,g] is nilpotent, and, by Engel, 
that each u & [g,g] is nilpotent. So it is enough to show that trus = 0, where 
s is the semisimple part of u, by Proposition 5 above. If it were true that s e g 
we would be done, but this need not be so. Write 



O0))p<? = HSpq)- 



H S P,q) = 4>( X H H ) 

= 0(AiJ +#..).. 

= (<l>(8))p,q 



g is solvable <^ tv(xy) = Va; e g, y G [g, g]. 
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Now for a, b, c G End(V) 



tr([a, 6]c) = 



tr(a6c — bac) 
ti(bca — bac) 
tr(6[c, a]) so 
^trQzi.j/ijs) 

^tr(y 4 [s, Si]) 



tr(us) = 



So it is enough to show that ads : g — > [g,g]. We know that adu : g — > [g, g], 
and we can, by Lagrange interpolation, find a polynomial P such that P(u) = s. 
The result now follows from Prop. 4: 

Since End(V) <~ Vi,i, take A = [g, g] and B = g. Then adu = so 
Ui.ig C [g,g[ and hence s M g C [g,g] or [s, x] G [g,g] Vx G g. QED 

4.5 Radical. 

If i is an ideal of g and g/i is solvable, then £)(") (g/i) = implies that £)(™)g C i. 
If i itself is solvable with £)( m )i = 0, then D( m +™)g = 0. So we have proved: 

Proposition 6 IfiCgis an ideal, and both i and g/i are solvable, so is g. 

If i and j are solvable ideals, then (i + j)/j ~ i/(i H j) is solvable, being the 
homomorphic image of a solvable algebra. So, by the previous proposition: 

Proposition 7Ifi and] are solvable ideals in g so is i+j. In particular, every 
Lie algebra g has a largest solvable ideal which contains all other solvable ideals. 
It is denoted by rad g or simply by r when g is fixed. 

An algebra g is called semi-simple if rad g = 0. Since Di is an ideal 
whenever i is (by Jacobi's identity), if r ^= then the last non-zero D^r is 
an abclian ideal. So an equivalent definition is: g is semi-simple if it has no 
non-zero abclian ideals. 

We shall call a Lie algebra simple if it is not abelian and if it has no proper 
ideals. We shall show in the next section that every semi-simple Lie algebra is 
the direct sum of simple Lie algebras in a unique way. 



Notice that if ( , ) is an invariant form, and i is an ideal, then i 1 - is again an 
ideal. 



4.6 The Killing form. 



A bilinear form ( , ) : g x g — > k is called invariant if 



([x,y],z) + (y,[x,z])=0 Vx,y,zGg. 



(4.2) 
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One way of producing invariant forms is from representations: if(p, V) is a 
representation of g, then 

(x,y) p := ti p(x)p(y) 

is invariant. Indeed, 

0,y],z) p + (y, [x,z]) p 



= to{{p{x)p{y) - P{y)p{x))p{z)) + p(y)(p(x)p(z) - p{z)p{x))} 
= te{p{x)p{y)p{z) - p(y)p(z)p(x)} 
= 0. 

In particular, if we take p = ad, V — g the corresponding bilinear form is 
called the Killing form and will be denoted by ( , ) K . We will also sometimes 
write n{x,y) instead of (x,y) K . 

Theorem 8 g is semi-simple if and only if its Killing form is non-degenerate. 

Proof. Suppose g is not semi-simple and so has a non-zero abelian ideal, a. 
We will show that (x,y) K = Vx e a, y e g. Indeed, let a — ad x ad y. Then 
a maps g — > a and a — > 0. Hence in terms of a basis starting with elements of 
a and extending, it (is upper triangular and) has along the diagonal. Hence 
tr<7 = 0. Hence if g is not scmisimplc then its Killing form is degenerate. 

Conversely, suppose that g is semi-simple. We wish to show that the Killing 
form is non-degenerate. So let u := g x = {x|trada:ady = Vy € g}. If 
x G u, z G g then 

tr{ad[x, z] ady} = tr{ad a; ad z ad y — adzadxady)} 
= tr{adx(ad0ady — ady adz)} 
= tradxad[z, y] 
= 0, 

so u is an ideal. In particular, tr u (ad x u ad y u ) = tr g (ad g x ad s y) for x, y € u, as 
can be seen from a block decomposition starting with a basis of u and extending 
to g. 

If we take y G Dm, we see that traduDadu = 0, so ad u is solvable by 
Cartan's criterion. But the kernel of the map u — > ad u is the center of u. So if 
ad u is solvable, so is u. QED 

Proposition 8 Let g be a semisimple algebra, i any ideal of g, and i 1 - its 
orthocomplement with respect to its Killing form. Then i fl i = 0. 

Indeed, i n i 1 - is an ideal on which tr ad x ad y = hence is solvable by Cartan's 
criterion. Since g is semi-simple, there are no non-trivial solvable ideals. QED 
Therefore 

Proposition 9 Every semi-simple Lie algebra is the direct sum of simple Lie 
algebras. 
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Proposition 10 Dg = g for a semi-simple Lie algebra. 
(Since this is true for each simple component.) 

Proposition 11 Let (f> : g — > s be a surjective homomorphism of a semi-simple 
Lie algebra onto a simple Lie algebra. Then if g = (J) gi is a decomposition of 
g into simple ideals, the restriction, fa of <p to each summand is zero, except for 
one summand where it is an isomorphism. 

Proof. Since s is simple, the image of every fa is or all of s. If fa is surjective 
for some i then it is an isomorphism since g^ is simple. There is at least one i 
for which it is surjective since <fi is surjective. On the other hand, it can not be 
surjective for for two ideals, gi, g^ i ^ j for then fagi, gj] = ^ [s, s] = s. QED 

4.7 Complete reducibility. 

The basic theorem is 

Theorem 9 [Weyl.] Every finite dimensional representation of a semi-simple 
Lie algebra is completely reducible. 

Proof. 

1. If p : g — > EndU is injectivc, then the form ( , ) p is non-degenerate. 
Indeed, the ideal consisting of all x such that (x, y) p = Vy € g is solvable 
by Cartan's criterion, hence 0. 

2. The Casimir operator. Let (ej) and (/j) be bases of g which are dual 
with respect to some non-degenerate invariant bilinear form, (,). So 
(ei,fj) = Sij. As the form is non-degenerate and invariant, it defines 
a map of 

g ® g i — > End g; x <£> y(w) = (y,w)x. 
This map is an isomorphism and is a g morphism. Under this map, 

fi(w) = y^(w, f l )e i = w 

by the definition of dual bases. Hence under the inverse map 

End g i— > g ® g 

the identity element, id, corresponds to e « ® fi ( an d so this expression 
is independent of the choice of dual bases). Since id is annihilated by 
commutator by any clement of End(g), we conclude that e « ® /* IS 
annihilated by the action of all (adx) 2 — ad a; ® 1 + 1 ® adx, x e g. 
Indeed, for x,e, f,y € g we have 

((ada;)2(e ® /)) y = (ad se ® / + e <g> ad x/) y 

= {f,y)[x,e] + ([x,f],y)e 

= (f,y)[x,e]-(f,[x,y])e by (4.2) 

= ((adx)(e®/) - (eO/)(adx))y. 
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Set 

C U(L). (4.3) 

i 

Thus C is the image of the element e i®/i under the multiplication map 
g®g i— > [/(g), and is independent of the choice of dual bases. Furthermore, 
C is annihilated by ad x acting on U (g) . In other words, it commutes with 
all elements of g, and hence with all of {7(g); it is in the center of (7(g). 

The C corresponding to the Killing form is called the Casimir element, 
its image in any representation is called the Casimir operator. 

3. Suppose that p : g — > Endy is injective. The (image of the) central 
element corresponding to ( . ) p defines an element of End V denoted by 
C p and 

trC p = trp(^ej/i) 

= tr^p(ei)p(/j) 

i 

= dim g 

With these preliminaries, we can state the main proposition: 

Proposition 12 Let O^V^W^k^O be an exact sequence of g modules, 
where g is semi-simple, and the action of g on k is trivial (as it must be). Then 
this sequence splits, i.e. there is a line in W supplementary to V on which g 
acts trivially. 

The proof of the proposition and of the theorem is almost identical to the proof 
we gave above for the special case of sl(2). We will need only one or two 
additional arguments. As in the case of si (2), the proposition is a special case 
of the theorem we want to prove. But we shall see that it is sufficient to prove 
the theorem. 

Proof of proposition. It is enough to prove the proposition for the case 
that V is an irreducible module. Indeed, if V\ is a submodule, then by induction 
on dim V we may assume the theorem is known for — > V/V\ — » W/V\ — > k — » 
so that there is a one dimensional invariant subspace M in W/V\ supplementary 
to V/V\ on which the action is trivial. Let be the inverse image of M in W . 
By another application of the proposition, this time to the sequence 

Q—*Vi—*N—*M—*Q 

we find an invariant line, P, in N complementary to V\. So N = V\ © P. Since 
(W/Vi) = (V/Vi) M we must have P n V = {0}. But since dim W = dim 
V + 1, we must have VF = V P. In other words P is a one dimensional 
subspace of W which is complementary to V. 
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Next we can reduce to proving the proposition for the case that g acts 
faithfully on V. Indeed, let i = the kernel of the action on V. For all x € g we 
have, by hypothesis, xW C V, and for x € i we have xV = 0. Hence Di acts 
trivially on W. But i = Di since i is semi-simple. Hence i acts trivially on W 
and we may pass to g/i. This quotient is again semi-simple, since i is a sum of 
some of the simple ideals of g. 

So we are reduced to the case that V is irreducible and the action, p, of g 
on V is injective. Then we have an invariant element C p whose image in End W 
must map W — > V since every element of g does. (We may assume that g^O.) 
On the other hand, C p ^ 0, indeed its trace is dim g. The restriction of C p to 
V can not have a non-trivial kernel, since this would be an invariant subspace. 
Hence the restriction of C p to V is an isomorphism. Hence ker C p : W — > V is 
an invariant line supplementary to V . We have proved the proposition. 

Proof of theorem from proposition. Let — > E' — > E be an exact 
sequence of g modules, and we may assume that E' ^ 0. We want to find an 
invariant complement to E' in E. Define W to be the subspace of Hom^E, E') 
whose restriction to E' is a scalar times the identity, and let V C W be the 
subspace consisting of those linear transformations whose restrictions to E' is 
zero. Each of these is a submodule of End(S). We get a sequence 

and hence a complementary line of invariant elements in W. In particular, we 
can find an element, T which is invariant, maps E — > E', and whose restriction 
to E 1 is non-zero. Then ker T is an invariant complementary subspace. QED 

As an illustration of construction of the Casimir operator consider g = sl{2) 
with 

h= (l -°i)' e= (o o)' /= (! o)' 

Then 

tr(ad/i) 2 =8 
tr(ade)(ad/) =4 

so the dual basis to the basis h, e, / is ft/8, //4, e/4, or, if we divide the metric 
by 4, the dual basis is ft/2, /, e and so the Casimir operator C is 

\h 2 + ef + fe= X -h 2 + h + 2fe. 
This coincides with the C that we used in Chapter II. 
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Chapter 5 



Conjugacy of Cartan 
subalgebras. 

It is a standard theorem in linear algebra that any unitary matrix can be di- 
agonalized (by conjugation by unitary matrices). On the other hand, it is easy 
to check that the subgroup T C U(n) consisting of all unitary matrices is a 
maximal commutative subgroup: any matrix which commutes with all diagonal 
unitary matrices must itself be diagonal; indeed if A is a diagonal matrix with 
distinct entries along the diagonal, any matrix which commutes with A must be 
diagonal. Notice that T is a product of circles, i.e. a torus. 

This theorem has an immediate generalization to compact Lie groups: Let 
G be a compact Lie group, and let T and T' be two maximal tori. (So T and T' 
are connected commutative subgroups (hence necessarily tori) and each is not 
strictly contained in a larger connected commutative subgroup). Then there 
exists an element a e G such that aT'a^ 1 = T. To prove this, choose one 
parameter subgroups of T and T' which are dense in each. That is, choose x 
and x' in the Lie algebra g of G such that the curve t i— > exp tx is dense in T 
and the curve t \— > expta' is dense in T". If we could find a £ G such that the 

a(expta')a _ = cxpi Ad a x' 

commute with all the expsx, then a(expte')a~ 1 would commute with all ele- 
ments of T, hence belong to T, and by continuity, aT'a^ 1 C T and hence = T. 
So we would like to find and a £ G such that 

[Ad a x',x] = 0. 

Put a positive definite scalar product ( , ) on g, the Lie algebra of G which is 
invariant under the adjoint action of G. This is always possible by choosing any 
positive definite scalar product and then averaging it over G. 
Choose a € G such that (Ad a x', x) is a maximum. Let 

y := Ad Q x' . 
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We wish to show that 

[y,x}=0. 

For any z£gwe have 

i[z,y],x) = ^(Ad cxpt ^y,x) |t=o = 
by the maximality. But 

{[z,y],x) = {z,[y,x}) 

by the invariancc of ( , ), hence [y,x] is orthogonal to all g hence 0. QED 

We want to give an algebraic proof of the analogue of this theorem for Lie 
algebras over the complex numbers. In contrast to the elementary proof given 
above for compact groups, the proof in the general Lie algebra case will be 
quite involved, and the flavor of the proof will by quite different for the solvable 
and semi-simple cases. Nevertheless, some of the ingredients of the above proof 
(choosing "generic elements" analogous to the choice of x and x' for example) 
will make their appearance. The proofs in this chapter follow Humphreys. 



5.1 Derivations. 

Let S be a derivation of the Lie algebra g. this means that 
5([y, z\) = [6{y),z] + [y, 5{z)\ V y, z E g. 

Then, for a, b E C 

(5 -a- b)[y,z] 
(5-a-b) 2 [y,z] 
(6-a-b) 3 [y,z] 



(6-a-b) n ly,z} 
Consequences: 

• Let g a = g a (<5) denote the generalized eigenspace corresponding to the 
eigenvalue a, so (5 — a) k = on g for large enough k. Then 

[go.gb] Cg [o+6] . (5.1) 

• Let s = s(S) denote the diagonizablc (semi-simple) part of S, so that 
s(5) = a on g a . Then, for y E g a , z E g b 

s(S) [y, z] = (a + b) [y, z] = [s(S)y, z] + [y, s(5)z] 

so s and hence also n — n(S), the nilpotent part of S are both derivations. 



= l(6-a)y,z] + [y,(5-b)z] 

- [(8 - a) 2 y, z] + 2[{5 - a)y, (S - b)z] + [y, (S - bfz] 
= [(5-a) 3 y ,z]+3[(S-a) 2 yi (S-by)] + 
3[(S-a)y,(5-bfz] + [y,(S-b) 3 z] 

= E( I )[(S-a) k y,(S-br- k z\. 
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• [6, ad a;] = ad(fe)]. Indeed, [5, ad a;] (it) — 5([x,u]) — [x, S(u)] = [S(x),u]. 
In particular, the space of inner derivations, Inn g is an ideal in Der g. 

• If g is scmisimplc then Inn g = Der g. Indeed, split off an invariant comple- 
ment to Inn g in Der g (possible by Weyl's theorem on complete reducibil- 
ity). For any 8 in this invariant complement, we must have [8, adx] = 
since [6, ad a;] — adSx. This says that Sx is in the center of g. Hence 
Sx = Vx hence 5 = 0. 

• Hence any x £ g can be uniquely written as x = s + n, s £ g, n £ g 
where ads is scmisimplc and adn is nilpotcnt. This is known as the 
decomposition into semi-simple and nilpotent parts for a semi-simple Lie 
algebra. 

• (Back to general g.) Let k be a subalgebra containing g (ada;) for some 
x £ g. Then x belongs g (adx) hence to k, hence ad a; preserves N g (k) 
(by Jacobi's identity). We have 

x £ g (ada;) C k C N s (k) C g 

all of these subspaces being invariant under ad a;. Therefore, the character- 
istic polynomial of ad a; restricted to N g (k) is a factor of the charactristic 
polynomial of ad x acting on g. But all the zeros of this characteristic 
polynomial are accounted for by the generalized zero eigenspace go (ad a:) 
which is a subspace of k. This means that ad a; acts on 7V g (k)/k without 
zero eigenvalue. 

On the other hand, adx acts trivially on this quotient space since i£k 
and hence [Ngk, x] C k by the definition of the normalizer. Hence 

N s (k) = k. (5.2) 

We now come to the key lemma. 

Lemma 3 Let k C g be a subalgebra. Let z £ k be such that go(adz) does not 
strictly contain any go(ada;), i£k. Suppose that 

k C g (ad2;). 

Then 

go(adz) C go(ady) V y £ k. 

Proof. Choose z as in the lemma, and let x be an arbitrary clement of k. 
By hypothesis, x £ go(adz) and we know that [go (ad z), go (ad z)] C go(adz). 
Therefore [x, go(ad z)\ C go(adz) and hence 

ad(z + ca;)g (adz) Cgo(ad2:) 

for all constants c. Thus ad(z + cx) acts on the quotient space g/go(adz). We 
can factor the characteristic polynomial of ad(z + cx) acting on g as 



(T) = f(T,c)g(T,c) 
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where / is the characteristic polynomial of ad(z + cx) on go (adz) and g is the 
characteristic polynomial of ad(z + cx) on g/go(adz). Write 

f(T,c) = T r + h{c)T r - 1 + ■ ■ ■ f r (c) r = dimg (adz) 
g(T,c) = T n ~ r + 9l {c)T n - r - 1 + ..- + , g„_ r (c) n = dimg. 

The fi and the gi are polynomials of degree at most i in c. Since is not an 
eigenvalue of adz on g/g (adz), we see that g n - r (0) ^ 0. So we can find r + 1 
values of c for which g n _ r (c) ^ 0, and hence for these values, 

g (ad(z + cx)) Cg (adz). 

By the minimality this forces 

g (ad(z + cx)) = g (adz) 

for these values of c. This means that f(T, c) = T r for these values of c, so each 
of the polynomials /i, . . . , f r has r + 1 distinct roots, and hence is identically 
zero. Hence 

g (ad(z + cx)) D go (adz) 
for all c. Take c = l,x = y — z to conclude the truth of the lemma. 

5.2 Cartan subalgebras. 

A Cartan subalgebra (CSA) is defined to be a nilpotent subalgebra which is its 
own normalizer. A Borcl subalgebra (BSA) is defined to be a maximal solvable 
subalgebra. The goal is to prove 

Theorem 10 Any two CSA 's are conjugate. Any two BSA 's are conjugate. 

Here the word conjugate means the following: Define 

■Af(g) = i x \ 3y G g, a^O, with x e g a (ady)}. 

Notice that every clement of Af(g) is ad nilpotent and that Af(g) is stable 
under Aut(g). As any x G Af(g) is nilpotent, exp adx is well defined as an 
automorphism of g, and we let 

denote the group generated by these elements. It is a normal subgroup of the 
group of automorphisms. Conjugacy means that there is a 4> G £(g) with 
0(hi) = h 2 where hi and h 2 are CSA's. Similarly for BSA's. 

As a first step we give an alternative characterization of a CSA. 

Proposition 13 h is a CSA if and only if h = go(adz) where g (adz) con- 
tains no proper subalgebra of the form g (adx). 
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Proof. Suppose h = go (ad z) which is minimal in the sense of the proposition. 
Then we know by (5.2) that h is its own normalizer. Also, by the lemma, 
h C go(ada;) Vx e h. Hence adx acts nilpotcntly on h for all x <E h. Hence, by 
Engel's theorem, h is nilpotent and hence is a CSA. 

Suppose that h is a CSA. Since h is nilpotent, we have h C go(adx), Vie 
h. Choose a minimal z. By the lemma, 

go(adz) C go(ada;) Vieh. 

Thus h acts nilpotcntly on go(adz)/h. If this space were not zero, we could 
find a non-zero common eigenvector with eigenvalue zero by Engel's theorem. 
This means that there is a y ^ h with [y, h]ch contradicting the fact h is its 
own normalizer. QED 

Lemma 4 // : g — > g' is a surjective homomorphism and h is a CSA of g 
then 0(h) is a CSA o/g'. 

Clearly 0(h) is nilpotent. Let k = Ker and identify g' = g/k so 0(h) = h + k. 
If x + k normalizes h + k then x normalizes h + k. But h = go (ad z) for some 
minimal such z, and as an algebra containing a g (ad z), h+k is self-normalizing. 
So x e h + k. QED 

Lemma 5 : g — > g' be surjective, as above, and h' a CSA o/g'. Any CSA 
h o/m — ^-^h') is a CSA o/g. 

h is nilpotent by assumption. We must show it is its own normalizer in g. By 
the preceding lemma, 0(h) is a Cartan subalgebra of h'. But 0(h) is nilpotent 
and hence would have a common eigenvector with eigenvalue zero in h'/0(h), 
contradicting the selfnormalizing property of 0(h) unless 0(h) — h'. So 0(h) = 
h'. If x € g normalizes h, then <fr(x) normalizes h'. Hence 0(x) <G h' so x € m 
so x <E h. QED 

5.3 Solvable case. 

In this case a Borel subalgebra is all of g so we must prove conjugacy for CSA's. 
In case g is nilpotent, we know that any CSA is all of g, since g = go(adz) for 
any z£g. So we may proceed by induction on dim g. Let hi and h.2 be Cartan 
subalgebras of g. We want to show that they are conjugate. Choose an abelian 
ideal a of smallest possible positive dimension and let g' — g/a. By Lemma 4 
the images h^ and h' 2 of hi and h.2 in g' are CSA's of g' and hence there is 
a a' e £(g') with a'(h[) — h' 2 . We claim that we can lift this to a, a € £(g)- 
That is, we claim 

Lemma 6 Let : g — > g' be a surjective homomorphism. If a' e £(g') then 
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there exists a a £ £(g) such that the diagram 




commutes. 

Proof of lemma. It is enough to prove this on generators. Suppose that 
x ' € ga(y') and choose y £ g, <f>(y) = y' so </>(g a (y)) = ga(y')> and nence we 
can find anie A/"(g) mapping on to x' . Then cxpadx is the desired a in the 
above diagram if a' = expadx'. QED 

Back to the proof of the conjugacy theorem in the solvable case. Let mi := 
_1 (h' 1 ), m 2 := (/^(hj). We have a a with a (mi) — m 2 so <r(hi) and h 2 are 
both CSA's of m 2 . If m 2 /gwe are done by induction. So the one new case 
is where 

g = a + hi = a + h 2 . 

Write 

h 2 = go (adz) 

for some x £ g. Since a is an ideal, it is stable under ad a; and we can split it 
into its and non-zero generalized eigenspaces: 

a = a (adx) © a* (ad a;). 

Since a is abelian, ad of every clement of a acts trivially on each summand, and 
since h 2 = g (adx) and a is an ideal, this decomposition is stable under h 2 , 
hence under all of g. By our choice of a as a minimal abelian ideal, one or the 
other of these summands must vanish. If a = ao (ad x) we would have a C h 2 
so g = h 2 and g is nilpotcnt. There is nothing to prove. So the only case to 
consider is a = a* (ad a;). Since h 2 = g (adx) we have 

a = g*(ada;). 

Since g = hi + a, write 

x = y + z, y £ hi, z £ g*(adx). 

Since adx is invertiblc on g*(ada;), write z = [x,z'], z' £ a* (ad a;). Since a is 
an abelian ideal, (adz') 2 = 0, so exp(adz') = 1 + adz'. So 

exp(ad z')x = x — z = y. 

So h := go(ady) is a CSA (of g), and since y £h 1 we have hi C go(ady) = h 
and hence hi = h. So expadz' conjugates h 2 into hi. Writing z' as sum of 
its generalized eigencomponents, and using the fact that all the elements of a 
commute, we can write the exponential as a product of the exponentials of the 
summands. QED 
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5.4 Toral subalgebras and Cartan subalgebras. 

The strategy is now to show that any two BSA's of an arbitrary Lie algebra are 
conjugate. Any CSA is nilpotcnt, hence solvable, hence contained in a BSA. 
This reduces the proof of the conjugacy theorem for CSA's to that of BSA's 
as we know the conjugacy of CSA's in a solvable algebra. Since the radical is 
contained in any BSA, it is enough to prove this theorem for semi-simple Lie 
algebras. So for this section the Lie algebra g will be assumed to be semi-simple. 

Since g does not consist entirely of ad nilpotent elements, it contains some x 
which is not ad nilpotent, and the semi-simple part of x is a non-zero ad semi- 
simple element of g. A subalgcbra consisting entirely of semi-simple elements 
is called toral, for example, the line through x s . 

Lemma 7 Any toral subalgebra t is abelian. 

Proof. The elements ad a;, x £ t can be each be diagonalized. We must 
show that ad a; has no eigenvectors with non-zero eigenvalues in t. Let y be an 
eigenvector so [x,y] = ay. Then (&dy)x = —ay is a zero eigenvector of ady, 
which is impossible unless ay = 0, since ad y annihilates all its zero eigenvectors 
and is invertible on the subspace spanned by the eigenvectors corresponding to 
non-zero eigenvalues. QED 

One of the consequences of the considerations in this section will be: 

Theorem 11 A subalgebra h of a semi-simple Lie algebra gis a CSA if and 
only if it is a maximal toral subalgebra. 

To prove this we want to develop some of the theory of roots. So fix a 
maximal toral subalgebra h. Decompose g into simultaneous eigenspaces 

g = C g (h)0g a (h) 

where 

C g (h) := {x e g|[ft, a;] = V ft G h} 
is the centralizer of h, where a ranges over non-zero linear functions on h and 

g«(h) := {x £ g| [ft., ar] = a(h)x V ft £ h}. 

As h will be fixed for most of the discussion, we will drop the (h) and write 

g = go © g Q 

where go = C g (h). We have 

• [ga,g^] C g a+ i3 (by Jacobi) so 

• ad a; is nilpotent if x £ g a , a^O 

• If a + (3 ^ then n(x, y) = V x £ g a , y £ gp. 
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The last item follows by choosing an h £ h with a(h) + (3(h) ^ 0. Then 
= n([h, x], y) + k(x, [h, y]) = (a(h) + f3(h))n(x, y) so k(x, y) — 0. This implies 
that go is orthogonal to all the g Q , a^O and hence the non-degeneracy of k 
implies that 

Proposition 14 The restriction of n to g x g is non-degenerate. 

Our next intermediate step is to prove: 
Proposition 15 

h = g (5.3) 

if h is a maximal toral subalgebra. 

Proceed according to the following steps: 

X £ g => x s £ g X n £ g . (5.4) 

Indeed, x £ go adx : h — * 0, and then ada; s , adx„ also map h — > 0. 

x £ go, x semisimplc ^ieh. (5.5) 

Indeed, such an x commutes with all of h. As the sum of commuting semi- 
simple transformations is again semisimple, we conclude that h + Cx is a toral 
subalgebra. By maximality it must coincide with h. 
We now show that 

Lemma 8 The restriction of the Killing form k to h x h is non-degenerate. 

So suppose that n(h, x) — V x £ h. This means that n(h, x) — V semi- 
simple x £ g . Suppose that n £ go is nilpotent. Since h commutes with n, 
(ad/i)(adn) is again nilpotent. Hence has trace zero. Hence n(h,n) = 0, and 
therefore n(h, x) = V x £ go- Hence h = 0. QED 
Next observe that 

Lemma 9 g is a nilpotent Lie algebra. 

Indeed, all semi-simple elements of g commute with all of go since they belong 
to h, and a nilpotent clement is ad nilpotent on all of g so certainly on go- 
Finally any x £ go can be written as a sum x s + x n of commuting elements 
which are ad nilpotent on g , hence x is. Thus go consists entirely of ad nilpotent 
elements and hence is nilpotent by Engel's theorem. QED 
Now suppose that h £ h, x,y £ go- Then 

K(h,[x,y\) = K([h,x],y) 

= k(0,v) 
= 

and hence, by the non-degeneracy of k on h, we conclude that 



5.5. ROOTS. 



81 



Lemma 10 

hn [ g0 , g ] = o. 

We next prove 

Lemma 11 go is abelian. 

Suppose that [go, go] 0- Since go is nilpotcnt, it has a non-zero center con- 
tained in [go, go]- Choose a non-zero element z £ [go, go] m this center. It can 
not be semi-simple for then it would lie in h. So it has a non-zero nilpotent part, 
n, which also must lie in the center of go, by the B C A theorem we proved in 
our section on linear algebra. But then ad n ad a; is nilpotent for any x £ go 
since [x,n] = 0. This implies that n(n,g ) = which is impossible. QED 
Completion of proof of (5.3). We know that go is abelian. But then, if 
h 7^ go, w e would find a non-zero nilpotcnt clement in go which commutes with 
all of go (proven to be commutative). Hence n(n,g ) = which is impossible. 
This completes the proof of (5.3). QED 
So we have the decomposition 

g = h®0g Q 

which shows that any maximal toral subalgebra h is a CSA. 

Conversely, suppose that h is a CSA. For any x = x s + x n £ g, g (adx s ) C 
go (ad x) since x n is an ad nilpotcnt clement commuting with ad x s . If we choose 
x £ h minimal so that h = go (ad a;), we see that we may replace x by x s 
and write h = go(adx s ). But go(ada; s ) contains some maximal toral algebra 
containing x s , which is then a Cartan subalgebra contained in h and hence must 
coincide with h. This completes the proof of the theorem. QED 

5.5 Roots. 

We have proved that the restriction of k to h is non-degenerate. This allows us 
to associate to every linear function on h the unique element t<f, £ h given by 

4>(h) = /c(fy, h). 

The set of a £ h*, a^O for which g a ^ is called the set of roots and is 
denoted by <&. We have 

• $ spans h* for otherwise 3h ^ : a(h) = Va £ $ implying that 
[ft,g„] = 0Va so [h,g] =0. 

• a £ <f> => —a £ <f> for otherwise g a _L g. 

• x £g a ,y £ g- a ,a £$=> [x,y] = n(x,y)t a . Indeed, 

K(h,[x,y\) = K([h,x],y) 

= K(t a ,h)n(x,y) 
= K(K(x,y)t a ,h). 
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• [ga> g—a] is one dimensional with basis t a . This follows from the preceding 
and the fact that g Q can not be perpendicular to g_ Q since otherwise it 
will be orthogonal to all of g. 

• a(t a ) = n(t a , t a ) ^ 0. Otherwise, choosing x £ g a , y e g_ Q with k(x, y) = 
1, we get 

[x,y] = t a , [t a ,x] = [t a ,y] = 0. 

So x, y, t a span a solvable three dimensional algebra. Acting as ad on g, 
it is superdiagonizable, by Lie's theorem, and hence ad t a , which is in the 
commutator algebra of this subalgebra is nilpotent. Since it is ad semi- 
simple by definition of h, it must lie in the center, which is impossible. 

• Choose e a e g a ,fa € g-a with 

K(e a ,f a ) = — — . 

Set 

2 

: ~ ^77 T\ ta ' 

Then e a , f a , h a span a subalgebra isomorphic to sZ(2). Call it sl(2) a . We 
shall soon see that this notation is justified, i.e that g a is one dimensional 
and hence that sl(2) a is well defined, independent of any "choices" of 
e Q , f a but depends only on a. 

• Consider the action of sl(2) a on the subalgebra m := h®@ g na where n £ 
Z. The zero eigenvectors of h a consist of h C m. One of these corresponds 
to the adjoint representation of sl{2) a C m. The orthocomplemcnt of 
h a € h gives dim h— 1 trivial representations of sl(2) a . This must exhaust 
all the even maximal weight representations, as we have accounted for all 
the zero weights of sl(2) a acting on g. In particular, dim g a = 1 and no 
integer multiple of a other than —a is a root. Now consider the subalgebra 
p := h ® ©g cct , c e C. This is a module for sl(2) a . Hence all such c's 
must be multiples of 1/2. But 1/2 can not occur, since the double of a 
root is not a root. Hence the ±a are the only multiples of a which are 
roots. 

Now consider (3 £ <&, (i ^ ±a. Let 

k : = 08/3+ja- 

Each non-zero summand is one dimensional, and k is an sl(2) a module. Also 
[3 + ia for any i, and evaluation on h a gives f3(h a ) + 2i. All weights differ 
by multiples of 2 and so k is irreducible. Let q be the maximal integer so that 
j3 + qa E <f>, and r the maximal integer so that /3 — ra 6 Then the entire 
string 

(3 — ra, [3 — (r — l)a, . . . [3 + qa 
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are roots, and 

0(h a )-2r=-(0(h a ) + 2q) 

or 

0(h a ) = r - q e Z. 

These integers are called the Cartan integers. 

We can transfer the bilinear form k from h to h* by defining 

(7,5) = n(t 7 ,t s ). 

So 

0(h a ) = n{tp,h a ) 
(a, a) ' 

So 

(a, a) 

Choose a basis ai,...,ag of h* consisting of roots. This is possible because 
the roots span h*. Any root /3 can be written uniquely as linear combination 

(3 = aai H h c e ae 

where the Cj are complex numbers. We claim that in fact the Cj are rational 
numbers. Indeed, taking the scalar product relative to ( , ) of this equation 
with the oii gives the i equations 

(0,oti) = ci(a\,ai) + h cg(ae,ai). 

Multiplying the i-th equation by 2/(a,-,aj) gives a set of t equations for the i 
coefficients q where all the coefficients are rational numbers as are the left hand 
sides. Solving these equations for the c» shows that the Ci are rational. 

Let E be the real vector space spanned by the Then ( , ) restricts 

to a real scalar product on E. Also, for any A ^ e E, 

(A, A) :=: K{t x ,t x ) 
:= tr(adt A ) 2 

> 0. 

So the scalar product ( , ) on E is positive definite. E is a Euclidean space. 

In the string of roots, /3 is q steps down from the top, so q steps up from the 
bottom is also a root, so 

/3 — (r — q)ot 
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is a root, or 

{a, a) 

But 

2(/3,g) 

P- 7 r« = s a (P) 

(a, a) 

where s a denotes Euclidean reflection in the hypcrplanc perpendicular to a. In 
other words, for every a £ $ 

s a : $ -> (5.6) 

The subgroup of the orthogonal group of E generated by these reflections 
is called the Weyl group and is denoted by W. We have thus associated to 
every semi-simple Lie algebra, and to every choice of Cartan subalgebra a finite 
subgroup of the orthogonal group generated by reflections. (This subgroup 
is finite, because all the generating reflections, s a , and hence the group they 
generate, preserve the finite set of all roots, which span the space.) Once we 
will have completed the proof of the conjugacy theorem for Cartan subalgebras 
of a semi-simple algebra, then we will know that the Weyl group is determined, 
up to isomorphism, by the semi-simple algebra, and does not depend on the 
choice of Cartan subalgebra. 



We define 

(f3,a) 



2(13, a) 
(a, a) ' 



So 



(M = (3(h a ) (5.7) 

= r-qeZ (5.8) 

and 

s a {0)=0-{l3,a)a. (5.9) 

So far, we have defined the reflection s a purely in terms of the root struction 
on E, which is the real subspace of h* generated by the roots. But in fact, 
s a , and hence the entire Weyl group arises as (an) automorphism (s) of g which 
preserve h. Indeed, we know that e a , f a , h a span a subalgebra sl(2) a isomorphic 
to sl(2). Now expade Q and expad(— f a ) are elements of £(g). Consider 

r a := (expadeo,)(expad(— / a ))(expade Q ) e £(g). (5.10) 

We claim that 

Proposition 16 The automorphism r a preserves h and on h it is given by 

r a (h) = h - a(h)h a . (5.11) 
In particular, the transformation induced by r a on E is s a . 
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Proof. It suffices to prove (5.11). If a(h) = 0, then both ad e a and ad f a vanish 
on h so T a {h) = h and (5.11) is true. Now h a and ker a span h. So we need 
only check (5.11) for h a where it says that r(h a ) = —h a . But we have already 
verified this for the algebra sl(2). QED 

We can also verify (5.11) directly. We have 

exp(ade a )(/i) = h — a(h)e a 

for any h £ h. Now [/„, e a ] — —h a so 

(ad/ Q ) 2 (e Q ) = [f a , -h a ] = [h a ,f a ] = -2f a . 

So 

cxp(-ad/ a )(expade Q )/i = (id - ad f a + i(ad f a ) 2 )(h - a(h)e a ) 

= h — a(h)e a — a(h)f a — a(h)h a + a(h)f a 
= h — a(h)h a — a(h)e a . 

If we now apply exp ad e a to this last expression and use the fact that a(h a ) = 2, 
we get the right hand side of (5.11). 

5.6 Bases. 

A C $ is a called a Base if it is a basis of E (so #A = £ = dimnE = dinich) 
and every (3 <G $ can be written as J2 a eA k a ce, k a £ Z with either all the 
coefficients k a > or all < 0. Roots are accordingly called positive or negative 
and we define the height of a root by 



ht/3:=^fc Q . 



Given a base, we get partial order on E by defining A^^iiffA — ^iisa sum of 
positive roots or zero. We have 

{a,P)<0, a,/3eA (5.12) 

since otherwise (a, (3) > and 

s a (P) = P - -, r« 

(a, a) 

is a root with the coefficient of (3 = 1 > and the coefficient of a < 0, contradict- 
ing the definition which says that roots must have all coefficients non-negative 
or non-positive. 

To construct a base, choose a 7 e E, | (7, /?) ^ V (3 £ <&. Such an element 
is called regular. Then every root has positive or negative scalar product with 
7, dividing the set of roots into two subsets: 

$ = $+U$~ 
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A root (3 £ $ + is called decomposable if (3 = ft + ft , ft , ft € ( f )+ , indecom- 
posable otherwise. Let A (7) consist of the indecomposable elements of <I> + (7). 

Theorem 12 A(7) is a base, and every base is of the form A(7) for some 7. 

Proof. Every (3 £ <& + can be written as a non-negative integer combination of 
A(7) for otherwise choose one that can not be so written with (7, (3) as small 
as possible. In particular, (3 is not indecomposable. Write (3 = ft + ft, ft € 
Then /? g A( 7 ),( 7 ,ft = (7, ft) + (7, ft) and hence (7, ft) < (7, ft and 
(7, ft) < (7,/?). By our choice of [3 this means ft and ft are non-negative 
integer combinations of elements of A (7) and hence so is ft contradiction. 

Now (5.12) holds for A = A (7) for if not, a — (3 is a root, so either a — (3 £ $ + 
soa = a- (3 + (3 is decomposable or (3 — a £ <& + and /3 is decomposable. 

This implies that A(7) is linearly independent: for suppose J2 a Ca0e = an( ^ 
let p Q be the positive coefficients and — qp the negative ones, so 

^2? a a = ^2q fJ f3 

a 

all coefficients positive. Let e be this common vector. Then (e, e) = ^2p a q(3(a, (3) < 

so e — which is impossible unless all the coefficients vanish, since all scalar 
products with 7 are strictly positive. Since the elements of <& span E this shows 
that A(7) is a basis of E and hence a base. 

Now let us show that every base is of the desired form: For any base A, 
let <i> + = $ + (A) denote the set of those roots which are non-negative integral 
combinations of the elements of A and let <I> - = <!> - (A) denote the ones which 
are non-positive integral combinations of elements of A. Define S ai a £ A to 
be the projection of a onto the orthogonal complement of the space spanned by 
the other elements of the base. Then 

(5 a , a') = 0, a ^ a', (5 a , a) = (S a , S a ) > 

so 7 — S r a°~a, r a > satisfies 

(7, a) > V a e A 

hence 

*+(A) C $+(7) 

and 

$-(A) c $-(7) 

hence 

$ + (A) = $+(7) and $-(A) = $-(7). 

Since every element of <i> + can be written as a sum of elements of A with non- 
negative integer coefficients, the only indecomposable elements can be the A, 
so A (7) C A but then they must be equal since they have the same cardinality 

1 = dim E. QED 



5.7. WEYL CHAMBERS. 
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5.7 Weyl chambers. 

Define Pp := (3- 1 . Then E — (J Pp is the union of Weyl chambers each con- 
sisting of regular 7's with the same <I> + . So the Weyl chambers are in one to 
one correspondence with the bases, and the Weyl group permutes them. 

Fix a base, A. Our goal in this section is to prove that the reflections 
s a , a € A generate the Weyl group, W, and that W acts simply transitively 
on the Weyl chambers. 

Each s a , a G A sends a ^ —a. But acting on A = ^ Cpf3, the reflection s a 
does not change the coefficient of any other element of the base. If A e <f> + and 
A 7^ a, we must have cp > for some (3 ^ a in the base A. Then the coefficient 
of (3 in the expansion of s a (A) is positive, and hence all its coefficients must be 
non-negative. So s a (X) e <f>+. In short, the only element of <I> + sent into <I> - is 
a. So if 

6 := - ^ (3 then s a S = 5 — a. 

If p e <l> + , (3 g A, then we can not have a) < Va e A for then /3 U A 
would be linearly independent. So /? — a is a root for some a <G A, and since we 
have changed only one coefficient, it must be a positive root. Hence any (3 e <f> 
can be written as 

/? = a\ H ha p ci,eA 

where all the partial sums are positive roots. 

Let 7 be any vector in a Euclidean space, and let s 7 denote reflection in the 
hyperplane orthogonal to 7. Let R be any orthogonal transformation. Then 

s fl7 = Rs^RT 1 (5.13) 

as follows immediately from the definition. 

Let ai,...,Qi £ A, and, for short, let us write Si := s a .. 

Lemma 12 If si ■ ■ ■ Si-ia^ < then 3j < i,j > 1 so that 

Si ■ ■ ■ Si = Si ■ ■ ■ Sj-is 1+1 ■ ■ ■ Sj_i. 

Proof. Set (3i-i :— a i: (3j := Sj + i ■ ■ ■ Sj-ia^, j < i — 1. Since /3j_i e <f>+ 
and (3q e <1>~ there must be some j for which f3j € $ + and Sj/?j = (3j-\ € 
implying that that /3j = so by (5.13) with R = Sj + \ ■ ■ ■ Sj_i we conclude that 

s j = ( s j+i • • • s i~i) s i( s j+i ■ ■ ■ Si-iy 1 

or 

S j S j + l ' ' ' S i ~ S J + 1 ' ' ' S «-l 

implying the lemma. QED 

As a consequence, if s — si ■ ■ ■ s t is a shortest expression for s, then, since 
s t a t € we must have sa 4 G <f> _ . 
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Keeping A fixed in the ensuing discussion, we will call the elments of A sim- 
ple roots, and the corresponding reflections simple reflections. Let W denote 
the subgroup of W generated by the simple reflections, s a , a <G A. (Eventually 
we will prove that this is all of W.) It now follows that if s £ W and sA = A 
then s = id. Indeed, if s ^ id, write s in a minimal fashion as a product of 
simple reflections. By what we have just proved, it must send some simple root 
into a negative root. So W' permutes the Weyl chambers without fixed points. 
We now show that W acts transitively on the Weyl chambers: 

Let 7 <E E be a regular element. We claim 

3 seW with (s(7),a) > V a e A. 

Indeed, choose s G W so that (s(j),S) is as large as possible. Then 

(s(l),S) > (s a s(j),5) 
= (s(l),s a 5) 

= (s(7),<*) - (s(7),a) so 
(s(7),a) > V a e A. 

We can't have equality in this last inequality since 5(7) is not orthogonal to any 
root. This proves that W acts transitively on all Weyl chambers and hence on 
all bases. 

We next claim that every root belongs to at least one base. Choose a (non- 
regular) 7' _L a, but 7' ^ Pp, (i ^ a. Then choose 7 close enough to 7' so that 
(7, a) > and (7, a) < |(7, (3) | V (3 ^ a. Then in ( f )+ (7) the clement a must be 
indecomposable. If (3 is any root, we have shown that there is an s' <G W with 
s'(3 — on e A. By (5.13) this implies that every reflection s^in W is conjugate 
by an element of W' to a simple reflection: sp = s'sis'" 1 S W' . Since W is 
generated by the sp, this shows that W — W . 



5.8 Length. 

Define the length of an element of W as the minimal word length in its expression 
as a product of simple roots. Define n(s) to be the number of positive roots 
made negative by s. We know that n(s) = £(s) if £(s) = or 1. We claim that 

£(s) = n {s) 

in general. 

Proof by induction on £(s). Write s = s\ ■ ■ ■ in reduced form and let 
a = on. We have sa € Q~ . Then n(ssi) = n(s) — 1 since Si leaves all positive 
roots positive except a. Also £(ssi) = £(s) — 1. So apply induction. QED 

Let C = C(A) be the Weyl chamber associated to the base A. Let C denote 
its closure. 

Lemma 13 //A,fi € C and s <£ W satisfies sX = \i then s is a product of 
simple reflections which fix X. In particular, X = [i. So C is a fundamental 
domain for the action of W on E. 
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Proof. By induction on l(s). If £(s) = then s = id and the assertion is clear 
with the empty product. So we may assume that n(s) > 0, so s sends some 
positive root to a negative root, and hence must send some simple root to a 
negative root. So let a € A be such that sa £ <i>~. Since [i £ C, we have 
(fj,, 0) > 0, V/3 £ and hence (fi, sa) < 0. So 

> (fi, sa) 
= (s"V,Q!) 
= (A, a) 
> 0. 

So (A, a) = so s a X = A and hence ss a X = /i. But n(ss a ) — n(s) — 1 since 
s a = —a and s a permutes all the other positive roots. So £(ss a ) — £(s) — 1 
and we can apply induction to conclude that s — (ss a )s a is a product of simple 
reflections which fix A. 

5.9 Conjugacy of Borel subalgebras 

We need to prove this for semi-simple algebras since the radical is contained in 
every maximal solvable subalgebra. 

Define a standard Borel subalgebra (relative to a choice of CSA h and a 
system of simple roots, A) to be 

b(A):=h® Sp . 

/3e$+(A) 

Define the corresponding nilpotent Lie algebra by 

n + (A) := ep . 

Sincc each s a can be realized as (exp e Q )(exp — / Q )(exp e a ) every element of W 
can be realized as an element of £(g). Hence all standard Borel subalgebras 
relative to a given Cartan subalgebra are conjugate. 

Notice that if x normalizes a Borel subalgebra, b, then 

[b + Cx, b + Cx] c b 

and so b + Cx is a solvable subalgebra containing b and hence must coincide 
with b: 

N s (b) = b. 

In particular, if x £ b then its semi-simple and nilpotent parts lie in b. 

From now on, fix a standard BSA, b. We want to prove that any other BSA, 
b' is conjugate to b. We may assume that the theorem is known for Lie algebras 
of smaller dimension, or for b' with bflb' of greater dimension, since if dim 
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b n b' = dim b, so that b' D b, we must have b' = b by maximality. Therefore 
we can proceed by downward induction on the dimension of the intersection 
bnb'. 

Suppose bnb' ^ 0. Let n' be the set of nilpotcnt elements in b n b'. So 
n' = n+ n b'. 

Also [b n b', b n b'] C n+ n b' = n' so n' is a nilpotent ideal in bnb'. 
Suppose that n' ^ 0. Then since g contains no solvable ideals, 

k := JVg(n') + g. 

Consider the action of n' on b/(bnb'). By Engel, there exists ay ^ bnb' with 
[x,y] £ b n b' Vx £ n'. But [x,y] £ [b,b] C n+ and so [x,y] £ n'. So y £ k. 
Thus t/Gknb, y^bnb'. Similarly, we can interchange the roles of b and 
b' in the above argument, replacing n + by the nilpotcnt subalgcbra [b',b'] of 
b', to conclude that there exists a y' £ k n b', y' £ b n b'. In other words, the 
inclusions 

knbDbnb', knb'Dbnb' 

are strict. 

Both b n k and b'nk are solvable subalgebras of k. Let c, c' be BSA's 
containing them. By induction, there is a a £ £ (k) C £(g) with ct(c') = c. Now 
let b" be a BSA containing c. We have 

b"nbDcnbDknbDb'nb 

with the last inclusion strict. So by induction there is a r £ £(g) with r(b") = b. 
Hence 

tct(c') C b. 

Then 

b n Tcr(b') D tct(c') n Tcr(b') D Tcr(b' n k) D To{h n b') 

with the last inclusion strict. So by induction we can further conjugate rah' 
into b. 

So we must now deal with the case that n' = 0, but we will still assume 
that bnb' ^ 0. Since any Borel subalgebra contains both the semi-simple and 
nilpotcnt parts of any of its elements, we conclude that bnb' consists entirely 
of semi-simple elements, and so is a toral subalgebra, call it t. If x £ b, t £ t = 
bnb' and [x,t] £ t, then we must have [x,t] =0, since all elements of [b, b] are 
nilpotent. So 

JVb(t) = C b (t). 

Let c be a CSA of Cb(t). Since a Cartan subalgebra is its own normalizer, we 
have tec. So we have 

t C c C C b (t) = JV b (t) C JVb(c). 

Let t £ t, n £ M>(c). Then [t, n] £ c and successive brackets by t will eventually 
yield 0, since c is nilpotent. Thus (ad t) k n = for some k, and since t is semi- 
simple, [t, n] — 0. Thus n £ C b (t) and hence n £ c since c is its own normalizer 
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in Cb(t). Thus c is a CSA of b. We can now apply the conjugacy theorem for 
CSA's of solvable algebras to conjugate c into h. 

So we may assume from now on that tch. If t = h, then decomposing 
b' into root spaces under h, we find that the non-zero root spaces must consist 
entirely of negative roots, and there must be at least one such, since b' ^ h. 
But then we can find a r a which conjugates this into a positive root, preserving 
h, and then T a (h') n b has larger dimension and we can further conjugate into 
b. 

So we may assume that 

tch 

is strict. 
If 

b' C C s (t) 

then since we also have h C C g (t), we can find a BSA, b" of C g (t) containing h, 
and conjugate b' to b", since we are assuming that t ^ and hence C g (t) ^ g. 
Since b"nb D h has bigger dimension than b' fl b, we can further conjugate to 
b by the induction hypothesis. 
If 

b' t c g (t) 

then there is a common non-zero eigenvector for ad t in b', call it x. So there 
is a t' e t such that [t',x] = c'x, d ^ 0. Setting 



we have [t, x] — x. Let <f> t C $ consist of those roots for which (3{t) is a positive 
rational number. Then 

s := h © g/3 

/3e* t 

is a solvable subalgebra and so lies in a BSA, call it b". Since t C b" ,x £ h" 
we see that b" n b' has strictly larger dimension than b (~l b'. Also b" n b has 
strictly larger dimension than bflb' since hcbfl b". So we can conjugate b' 
to b" and then b" to b. 

This leaves only the case b n b' = which we will show is impossible. Let 
t be a maximal toral subalgebra of b'. We can not have t = 0, for then b' 
would consist entirely of nilpotent elements, hence nilpotcnt by Engel, and also 
self-normalizing as is every BSA. Hence it would be a CSA which is impossible 
since every CSA in a semi-simple Lie algebra is toral. So choose a CSA, h" 
containing t, and then a standard BSA containing h". By the preceding, we 
know that b' is conjugate to b" and, in particular has the same dimension as b". 
But the dimension of each standard BSA (relative to any Cartan subalgebra) 
is strictly greater than half the dimension of g, contradicting the hypothesis 
gDbffib'. QED 
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Chapter 6 



The simple finite 
dimensional algebras. 

In this chapter we classify all possible root systems of simple Lie algebras. A 
consequence, as we shall see, is the classification of the simple Lie algebras 
themselves. The amazing result - due to Killing with some repair work by Elic 
Cartan - is that with only five exceptions, the root systems of the classical 
algebras that we studied in Chapter III exhaust all possibilities. 

The logical structure of this chapter is as follows: We first show that the 
root system of a simple Lie algebra is irreducible (definition below). We then 
develop some properties of the of the root structure of an irreducible root system, 
in particular we will introduce its extended Cartan matrix. We then use the 
Perron-Frobenius theorem to classify all possible such matrices. (For the expert, 
this means that we first classify the Dynkin diagrams of the affine algebras of 
the simple Lie algebras. Surprisingly, this is simpler and more efficient than 
the classification of the diagrams of the finite dimensional simple Lie algebras 
themselves.) From the extended diagrams it is an easy matter to get all possible 
bases of irreducible root systems. We then develop a few more facts about root 
systems which allow us to conclude that an isomorphism of irreducible root 
systems implies an isomorphism of the corresponding Lie algebras. We postpone 
the the proof of the existence of the exceptional Lie algebras until Chapter VIII, 
where we prove Serre's theorem which gives a unified presentation of all the 
simple Lie algebras in terms of generators and relations derived directly from 
the Cartan integers of the simple root system. 

Throughout this chapter we will be dealing with semi-simple Lie algebras 
over the complex numbers. 
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6.1 Simple Lie algebras and irreducible root sys- 
tems. 

We choose a Cartan subalgebra h of a semi-simple Lie algebra g, so we have the 
corresponding set <& of roots and the real (Euclidean) space E that they span. 
We say that $ is irreducible if <f> can not be partitioned into two disjoint 
subsets 

such that every element of $1 is orthogonal to every element of $2- 
Proposition 17 7/g is simple then $ is irreducible. 

Proof. Suppose that $ is not irreducible, so we have a decomposition as above. 
If a e $1 and (3 e $2 then 

(a + 0, a) = (a, a) > and (a + /3, /3) = (/?,/?)> 

which means that a + (3 can not belong to either <!>! or <j> 2 and so is not a root. 
This means that 

[gcng/?] = 0. 

In other words, the subalgebra gi of g generated by all the g Q , a £ $1 is 
centralized by all the g^, so gi is a proper subalgebra of g, since if gi = g this 
would say that g has a non-zero center, which is not true for any semi-simple 
Lie algebra. The above equation also implies that the normalizer of gi contains 
all the g 7 where 7 ranges over all the roots. But these g 7 generate g. So gi is 
a proper ideal in g, contradicting the assumption that g is simple. QED 

Let us choose a base A for the root system $ of a semi-simple Lie algebra. 
We say that A is irreducible if we can not partition A into two non-empty 
mutually orthogonal sets as in the definition of irreducibility of <& as above. 

Proposition 18 <& is irreducible if and only if A is irreducible. 

Proof. Suppose that $ is not irreducible, so has a decomposition as above. 
This induces a partition of A which is non-trivial unless A is wholly contained 
in <!>! or <f> 2 . If A C $1 say, then since E is spanned by A, this means that all 
the elements of <& 2 are orthogonal to E which is impossible. So if A is irreducible 
so is <&. Conversely, suppose that 

A = Ai U A 2 

is a partition of A into two non-empty mutually orthogonal subsets. We have 
proved that every root is conjugate to a simple root by an element of the Weyl 
group W which is generated by the simple reflections. Let $1 consist of those 
roots which are conjugate to an element of Ai and $2 consist of those roots 
which are conjugate to an element of A2. The reflections sp,(3 £ A2 commute 
with the reflections s a , a £ Ai, and furthermore 



sp(a) = a 



6.2. THE MAXIMAL ROOT AND THE MINIMAL ROOT. 
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since (a, (3) = 0. So any element of 5>i is conjugate to an element of Ai by 
an element of the subgroup W\ generated by the s a , a £ A\. But each such 
reflection adds or subtracts a. So <I>i is in the subspace E\ of E spanned by Ai 
and so is orthogonal to all the elements of $2. So if $1 is irreducible so is A. 
QED 

We are now into the business of classifying irreducible bases. 

6.2 The maximal root and the minimal root. 

Suppose that <& is an irreducible root system and A a base, so irreducible. 
Recall that once we have chosen A, every root (3 is an integer combination of 
the elements of A with all coefficients non-negative, or with all coefficients non- 
positive. We write (3 >- in the first case, and [3 -< in the second case. This 
defines a partial order on the elements of E by 

(i -< A if and only if A — fi = ^ k a a, (6.1) 

where the k a are non- negative integers. This partial order will prove very im- 
portant to us in representation theory. 

Also, for any (3 = Y1 *= ^ + we define its height by 

ht/3 = ^fc Q . (6.2) 

a 

Proposition 19 Suppose that $ is an irreducible root system and A a base. 
Then 

• There exists a unique (3 G which is maximal relative to the ordering 
-<. 

• This (3 = ^k a a where all the k a are positive. 

• {(3, ct) > for all a e A and (/?, a) > for at least one a G A. 

Proof. Choose a (3 — ^k a a which is maximal relative to the ordering. 
At least one of the k a > 0. We claim that all the k a > 0. Indeed, suppose 
not. This partitions A into Ai, the set of a for which k a > and A 2 , the 
set of a for which k a — 0. Now the scalar product of any two distinct simple 
roots is < 0. (Recall that this followed from the fact that if (0:1,0:2) > 0, 
then S2(oi) = Oi — (01,02)02 would be a root whose «i coefficient is positive 
and whose 02 coefficient is negative which is impossible.) In particular, all the 
(01,02) < 0, 01 e Ai, o 2 e A 2 and so 

(f3,a 2 ) < 0, V o 2 £ A 2 . 

The irrcducibility of A implies that (01,02) 7^ for at least one pair oi £ 
Ai, o 2 £ A 2 . But this scalar product must then be negative. So 



(A o 2 ) <0 
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and hence 



is a root with 



2 P = (}- (p,a 2 )a 2 



s a2 P-pyO 



contradicting the maximality of p. So we have proved that all the k a are positive. 
Furthermore, this same argument shows that (/3, a) > for all a <E A. Since the 
elements of A form a basis of E, at least one of the scalar products must not 
vanish, and so be positive. We have established the second and third items in 
the proposition for any maximal p. We will now show that this maximal weight 
is unique. 

Suppose there were two, p and /?'. Write P' = ^ k' a a where all the k' a > 0. 
Then (P,P') > since (P,a) > for all a and > for at least one. Since spP' 
is a root, this would imply that P — P' is a root, unless /? = /?'. But if P — p' is 
a root, it is either positive or negative, contradicting the maximality of one or 
the other. QED 

Let us label the elements of A as a\, . . . , at, and let us set 



a :- 



-P 



so that «o is the minimal root. From the second and third items in the propo- 
sition we know that 

cto + + • • • + ktOLi = (6-3) 

and that 

{a ,ai) < 

for all i and < for some i. 

Let us take the left hand side (call it 7) of (6.3) and successively compute 
(7, i = 0, 1, . . . ,£. We obtain 



/ 2 (ai,a ) 
(a ,ai) 2 



(at-i,ae) 



(at, ai) 



/ 



ki 



\ktj 



0. 



This means that if we write the matrix on the left of this equation as 21 — A, 
then A is a matrix with on the diagonal and whose i,j entry is —((Xj, a-i). 
So A a non-negative matrix with integer entries with the properties 

• if Aij ^ then A di ^ 0, 

• The diagonal entries of A are 0, 

• A is irreducible in the sense that we can not partition the indices into two 
non-empty subsets / and J such that A^ = V i £ J, j € J and 



• A has an eigenvector of eigenvalue 2 with all its entries positive. 



6.3. GRAPHS. 
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We will show that the Perron-Frobcnius theorem allows us to classify all 
such matrices. From here it is an easy matter to classify all irreducible root 
systems and then all simple Lie algebras. For this it is convenient to introduce 
the language of graph theory. 

6.3 Graphs. 

An undirected graph T = (N, E) consists of a set N (for us finite) and a 
subset E of the set of subsets of N of cardinality two. We call elements of N 
"nodes" or "vertices" and the elements of E "edges". If e = {i,j} € E we 
say that the "edge" e joins the vertices i and j or that u i and j are adjacent". 
Notice that in this definition our edges are "undirected": {i, j} — {j, i}, and we 
do not allow self-loops. An example of a graph is the "cycle" A^p with £ + 1 
vertices, so N = {0, 1, 2, . . . , £} with adjacent to £ and to 1, with 1 adjacent 
to and to 2 etc. 

The adjacency matrix A of a graph T is the (symmetric) — 1 matrix 
whose rows and columns are indexed by the elements of N and whose i,j-th 
entry A^ = 1 if i is adjacent to j and zero otherwise. 

For example, the adjacency matrix of the graph A^ is 



fo 


i 





1\ 


1 





1 








1 





1 


V 





1 


<V 



We can think of A as follows: Let V be the vector space with basis given by 
the nodes, so we can think of the z-th coordinate of a vector x e V as assigning 
the value Xi to the node i. Then y = Ax assigns to i the sum of the values Xj 
summed over all nodes j adjacent to i. 

A path of length r is a sequence of nodes x^ , Xi 2 , . . . , x ir where each node 
is adjacent to the next. So, for example, the number of paths of length 2 joining 
i to j is the i, j-th entry in A 2 and similarly, the number of paths of length r 
joining i to j is the i,j-th entry in A r . The graph is said to be connected if 
there is a path (of some length) joining every pair of vertices. In terms of the 
adjacency matrix, this means that for every i and j there is some r such that 
the i,j entry of A r is non-zero. In terms of the theory of non-negative matrices 
(see below) this says that the matrix A is irreducible. 

Notice that if 1 denotes the column vector all of whose entries are 1, then 1 
is an eigenvector of the adjacency matrix of A^\ with eigenvalue 2, and all the 
entries of 1 are positive. In view of the Perron-Frobenius theorem to be stated 
below, this implies that 2 is the maximum eigenvalue of this matrix. 

We modify the notion of the adjacency matrix as follows: We start with a 
connected graph T as before, but modify its adjacency matrix by replacing some 
of the ones that occur by positive integers . If, in this replacement > 1 , 
we redraw the graph so that there is an arrow with lines pointing towards 
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the node i. For example, the graph labeled A^ in Table Aff 1 corresponds to 
the matrix 

'0 2 



2 

which clearly has ^j^j as an positive eigenvector with eigenvalue 2. 

(2) 

Similarly, diagram A\ in Table Aff 2 corresponds to the matrix 

4 
x 1 

which has {^j as eigenvector with eigenvalue 2. In the diagrams, the coefficient 

next to a node gives the coordinates of the eigenvector with eigenvalue 2, and it 
is immediate to check from the diagram that this is indeed an eigenvector with 
eigenvalue 2. For example, the 2 next to a node with an arrow pointing toward 
it in Cf ] satisfies 2-2 = 2-1 + 2 etc. 

It will follow from the Perron Frobenius theorem to be stated and proved 
below, that these are the only possible connected diagrams with maximal eigen- 
vector two. 

All the graphs so far have zeros along the diagonal. If we relax this condi- 
tion, and allow for any non-negative integer on the diagonal, then the only new 
possibilities are those given in Figure 4. 

Let us call a matrix symmetrizable if Afj ^ => Aji ^ 0. The main result 
of this chapter will be to show that the lists in the Figures 1-4 exhaust all irre- 
ducible matrices with non-negative integer matrices, which are symmetrizable 
and have maximum eigenvalue 2. 



6.4 Perron-Frobenius. 

We say that a real matrix T is non-negative (or positive) if all the entries 
of T are non-negative (or positive). We write T > or T > 0. We will use 
these definitions primarily for square (n x n) matrices and for column vectors 
= (n x 1) matrices. We let 

Q := {x e R" : x > 0, x + 0} 

so Q is the non-negative "orthant" excluding the origin. Also let 

C := {x > : ||x|| = 1}. 

So C is the intersection of the orthant with the unit sphere. 

A non-negative matrix square T is called primitive if there is a k such that 
all the entries of T k are positive. It is called irreducible if for any i,j there 
is a k — k(i,j) such that (T k )ij > 0. For example, as mentioned above, the 
adjacency matrix of a connected graph is irreducible. 
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Figure 6.1: Aff 1. 
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Figure 6.2: Aff 2 
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Figure 6.3: Aff 3 
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Figure 6.4: Loops allowed 
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If T is irreducible then / + T is primitive. 

In this section we will assume that T is non-negative and irreducible. 
Theorem 13 Perron-Frobenius. 

1. T has a positive (real) eigenvalue A max such that all other eigenvalues of 
T satisfy 

2. Furthermore A max has algebraic and geometric multiplicity one, and has 
an eigenvector x with x > 0. 

3. Any non-negative eigenvector is a multiple of x. 

4- More generally, if y > 0, y ^ is a vector and [i is a number such that 

Ty < \iy 

then 

y > 0, and fi > A max 
with \i — A max if and only if y is a multiple of x. 

5- If < S < T, S 7^ T then every eigenvalue a of S satisfies \a\ < A max . 

6. In particular, all the diagonal minors T^ obtained from T by deleting 
the i-th row and column have eigenvalues all of which have absolute value 
*^ A max . 

We will present a proof of this theorem after first showing how it classifies 
the possible connected diagrams with maximal eigenvalue two. But first let us 
clarify the meaning of the last two assertions of the theorem. The matrix Tu\ is 
usually thought of as an (n — 1) x (n — 1) matrix obtained by "striking out" the 
i-th row and column. But we can also consider the matrix Tj obtained from T 
by replacing the i-th row and column by all zeros. If x is an n- vector which is 
an eigenvector of Tj, then the n— 1 vector y obtained from x by omitting the (0) 
i-th entry of x is then an eigenvector of T^ with the same eigenvalue (unless 
the vector x only had non-zero entries in the i-th position). Conversely if y is 
an eigenvector of T^ then inserting at the i-th position will give an n-vector 
which is an eigenvector of Ti with with the same eigenvalue as that of y. 

More generally, suppose that S is obtained from T by replacing a certain 
number of rows and the corresponding columns by all zeros. Then we may apply 
item 5) of the theorem to this n x n matrix, S, or the "compressed version" of 
S obtained by eliminating all these rows and columns. 

We will want to apply this to the following special case. A subgraph V 
of a graph T is the graph obtained by eliminating some nodes, and all edges 
emanating from these nodes. Thus, if A is the adjacency matrix of T and A' 
is the adjacency matrix of A, then A' is obtained from A by striking out some 
rows and their corresponding columns. Thus if T is irreducible, so that we may 
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apply the Perron Frobenius theorem to A, and if T' is a proper subgraph (so 
we have actually deleted some rows and columns of A to obtain A'), then the 
maximum eigenvalue of A' is strictly less than the maximum eigenvalue of A' 
is strictly less than the maximum eigenvalue of A. Similarly, if an entry Aij is 
> 1, the matrix A' obtained from A by decreasing this entry while still keeping 
it positive will have a strictly smaller maximal eigenvalue. 

We now apply this theorem to conclude that the diagrams listed in Figures 
Aff 1, Aff2, and Aff 3 are all possible connected diagrams with maximal eigen- 
value two. A direct check shows that the vector whose coordinate at each node is 
the integer attached to that node given in the figure is an eigenvector with eigen- 
value 2. Perron-Frobcnius then guarantees 2 is the maximal eigenvalue. But 
now that we have shown that for each of these diagrams the maximal eigenvalue 
is two, any "larger" diagram must have maximal eigenvalue strictly greater than 
two and any "smaller" diagram must have maximal eigenvalue strictly less than 
two. 

To get started, this argument shows that A^ is the only diagram for which 
there is an i, j for which both and dji are > 1. Indeed, if A were such a 
matrix, by striking out all but the i and j rows and columns, we would obtain a 
two by two matrix whose off diagonal entries arc both > 2. If there were strict 
inequality, the maximum eigenvalue of this matrix would have to be bigger than 
2 (and hence also the original diagram) by Perron Frobenius. 

So other than A^\ we may assume that if > 1 then a rji = 1. 

(2) 

Since any diagram with some entry a^j > 4 must contain A 2 we see that 
this is the only diagram with this property and with maximum eigenvalue 2. 
So other than this case, all < 3. 

Diagram shows that a diagram with only two vertices and a triple bond 
has maximum eigenvalue strictly less than 2, since it is contained in G^ as 
a subdiagram. So any diagram with a triple bond must have at least three 
vertices. But then it must "contain" either G^ or But as both of these 

have maximal eigenvalue 2, it can not strictly contain either. So G^ and 

(3) 

D\ .are the only possibilities with a triple bond. 

Since A^\ £ > 2 is a cycle with maximum eigenvalue 2, no graph can contain 
a cycle without actually being a cycle, i.e. being Af . On the other hand, a 
simple chain with only single bonds is contained in A^\ and so must have 
maximum eigenvalue strictly less than 2, So other than Af\ every candidate 
must contain at least one branch point or one double bond. 

If the graph contains two double bonds, there are three possibilities as to 
the mutual orientation of the arrows, they could point toward one another as 
in away from one another as in or in the same direction as in A^ . 

But then these are the only possibilities for diagrams with two double bonds, 
as no diagram can strictly contain any of them. 

Also, striking off one end vertex of yields a graph with one extreme 
vertex with a double bound, with the arrow pointing away from the vertex, and 
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no branch points. Striking out one of the two vertices at the end opposite the 
double bond in h^p yields a graph with one extreme vertex with with a double 
bound and with the arrow pointing toward this vertex. So either diagram must 
have maximum eigenvalue < 2. 

Thus if there are no branch points, there must be at least one double bond 
and at least two vertices on either side of the double bond. The graph with 
exactly two vertices on either side is strictly contained in F^p and so is excluded. 
So there must be at least three vertices on one side and two on the other of the 
double bond. But then F^ and E^p exhaust the possibilities for one double 
bond and no branch points. 

If there is a double bond and a branch point then either the double bond 
points toward the branch, as in A 2 2 g_ 1 or away from the branch as in Bf\ But 
then these exhaust the possibilities for a diagram containing both a double bond 
and a branch point. 

If there are two branch points, the diagram must contain and hence 
must coincide with D^p . 

So we arc left with the task of analyzing the possibilities for diagrams with 
no double bonds and a single branch point. Let m denote the minimum number 
of vertices on some leg of a branch (excluding the branch point itself). If m > 2, 
then the diagram contains and hence must coincide with E^ . So we may 
assume that m = 1. If two branches have only one vertex emanating, then the 
diagram is strictly contained in and hence excluded. So each of the two 
other legs have at least two or more vertices. If both legs have more than two 
vertices on them, the graph must contain, and hence coincide with E^ . We 
are left with the sole possibility that one of the legs emanating from the branch 
point has one vertex and a second leg has two vertices. But then either the 
graph contains or is contained in E^ so is the only such possibility. 

We have completed the proof that the diagrams listed in Aff 1, Aff 2 and 
Aff 3 are the only diagrams without loops with maximum eigenvalue 2. 

If we allow loops, an easy extension of the above argument shows that the 
only new diagrams are the ones in the table "Loops allowed" . 



6.5 Classification of the irreducible A. 

Notice that if we remove a vertex labeled 1 (and the bonds emanating from it) 
from any of the diagrams in Aff 2 or Aff 3 we obtain a diagram which can also 
be obtained by removing a vertex labeled 1 from one of the diagrams in Aff 1. 
(In the diagram so obtained we ignore the remaining labels.) Indeed, removing 
the right hand vertex labeled 1 from D^p yields A2 which is obtained from A^ 
by removing a vertex. Removing the left vertex marked 1 gives G2, the diagram 
obtained from G^p by removing the vertex marked 1. 

Removing a vertex from A 2 gives A\. Removing the vertex labeled 1 from 
A 2 2 g yields B 2 i, obtained by removing one of the vertices labeled 1 from . 
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Removing a vertex labeled 1 from A 2 ^_ 1 yields D^t or Cii, removing a vertex 

labeled 1 from D < ^ 1 yields B l+ i and removing a vertex labeled 1 from E^ 
yields F 4 or C 4 . 

Thus all irreducible A correspond to graphs obtained by removing a vertex 
labeled 1 from the table Aff 1. So we have classified all possible Dynkin diagrams 
of all irreducible A . They are given in the table labeled Dynkin diagrams. 



6.6 Classification of the irreducible root systems. 

It is useful to introduce here some notation due to Bourbaki: A subset $ of a 
Euclidean space E is called a root system if the following axioms hold: 

• $ is finite, spans E and does not contain 0. 

• If a G $ then the only multiples of a which arc in $ are ±a. 

• If a G <f> then the reflection s a in the hypcrplane orthogonal to a sends $ 
into itself. 

• If a,0 e $ then (0,a) e Z, 
Recall that 

((3, a) 

so that the reflection s a is given by 

s a (0) = 0-(0,a)a. 

We have shown that each semi-simple Lie algebra gives rise to a root system, and 
derived properties of the root system. If we go back to the various arguments, 
we will find that most of them apply to a "general" root system according to the 
above definition. The one place where we used Lie algebra arguments directly, 
was in showing that if ^ ±a is a root then the collection of j such that (3 + ja 
is a root forms an unbroken chain going from — r to q where r — q = ((3, a). 
For this we used the representation theory of sl(2). So we now pause to give 
an alternative proof of this fact based solely on that axioms above, and in the 
process derive some additional useful information about roots. 

For any two non-zero vectors a and (3 in E, the cosine of the angle between 
them is given by 

|H|||/3||cos0=(a,/3). 

So 

(Aa>=2^cose. 

IMI 

Interchanging the role of a and [3 and multiplying gives 



2(0, a) 
(a, a) 



(0,a)(a,0) = 4 cos 2 9. 
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Figure 6.5: Dynkin diagrams. 
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The right hand side is a non-negative integer between and 4. So assuming 
that a ±/3 and \\f3\\ > \\a\\ The possibilities arc listed in the following table: 



\a,(3) 




< < 7T 


II/3II7NI 2 








tt/2 


undetermined 


1 


1 


tt/3 


1 


-1 


-1 


2tt/3 


1 


1 


2 


tt/4 


2 


-1 


-1 


3tt/4 


2 


1 


3 


tt/6 


3 


-1 


-3 


5tt/6 


3 



Proposition 20 If a ^ ±/3 and if (a, (3) > i/ien a — [3 is a root. If (a, (3) < 
i/ien a + /3 is a root 

Proof. The second assertion follows from the first by replacing (3 by —(3. So 
we need to prove the first assertion. From the table, one or the other of (f3, a) 
or (a, (3) equals one. So either s a f3 — (3 — a is a root or spa — a — (3 is a root. 
But roots occur along with their negatives so in either event a — (3 is a root. 
QED 

Proposition 21 Suppose that a ^ ±/3 are roots. Let r be the largest integer 
such that (3 — ra is a root, and let q be the largest integer such that (3 + qa is a 
root. Then (3 + ia is a root for all —r<i<q. Furthermore r — q = (f3,a) so 
in particular \q — r\ < 3. 

Proof. Suppose not. Then we can find a p and an s such that — r < p < s < q 
such that (3 + pa is a root, but (3 + (p + l)a is not a root, and (3 + sa is a root 
but (3 + (s — l)a is not. The preceding proposition then implies that 

([3 + pa, a)>0 while (/3 + sa, a) < 

which is impossible since (a, a) = 2 > 0. 

Now s a adds a multiple of a to any root, and so preserves the string of roots 
(3 — ra, (3 — (r — l)a, . . . , [3 + qa. Furthermore 

s a {(3 + ia) — (3 — ((/?, a) + i)a 

so s a reverses the order of the string. In particular 

s a ((3 + qa) = (3- ra. 

The left hand side is (3— {{[3, a)+q)a so r— q = (/?, a) as stated in the proposition. 
QED 

We can now apply all the preceding definitions and arguments to conclude 
that the Dynkin diagrams above classify all the irreducible bases A of root 
systems. 
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Since every root is conjugate to a simple root, we can use the Dynkin dia- 
grams to conclude that in an irreducible root system, either all roots have the 
same length (cases A, D, E) or there are two root lengths - the remaining cases. 
Furthermore, if (3 denotes a long root and a a short root, the ratios ||/3|| 2 /||a|| 2 
are 2 in the cases B, C, and and 3 for the case G*2. 

Proposition 22 In an irreducible root system, the Weyl group W acts irre- 
ducibly on E. In particular, the W -orbit of any root spans E. 

Proof. Let E' be a proper invariant subspace. Let E" denote its orthogonal 
complement, so 

E = E' E". 

For any root a, If e € E' then s a e = e — (e, a) a EE'. So either (e, a) = for 
all e, and so a e E" or a e E'. Since the roots span, they can't all belong to 
the same subspace. This contradicts the irreducibility. QED 

Proposition 23 If there are two distinct root lengths in an irreducible root 
system, then all roots of the same length are conjugate under the Weyl group. 
Also, the maximal weight is long. 

Proof. Suppose that a and j3 have the same length. We can find a Weyl group 
element W such that w(3 is not orthogonal to a by the preceding proposition. 
So we may assume that {(3, a) ^ 0. Since a and (3 have the same length, by the 
table above we have ((3, a) — ±1. Replacing (3 by —f3 — sp[3 we may assume 
that {13, a) = 1. Then 

{sps a sp)(a) = {sps a ){a - (3) 
= sp(-a — (3 + a) 
= sp{-(3) 
= (3. QED 

Let (E,&) and (E',&') be two root systems. We say that a linear map 
/ : E — > E' is an isomorphism from the root system (E, <&) to the root system 
(E' , $') if / is a linear isomorphism of E onto E' with /(<!>) = and 

(f(J3),f(a)) = <J3,a) 

for all a,[3e$. 

Theorem 14 Let A = {a\, . . . ,ae} be a base 0/$. Suppose that (£',$') is a 
second root system with base A' = {a[, . . . , a' e } and that 

{a^otj} = {a'^a'j}, Vl<i,j<£. 

Then the bijection 

ai 1 ► a\ 
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extends to a unique isomorphism f : [E, <f>) — > (_E',<I>'). In other words, the 
Cartan matrix A of A determines <& up to isomorphism. In particular, The 
Dynkin diagrams characterize all possible irreducible root systems. 

Proof. Since A is a basis of E and A' is a basis of E' , the map a, i— > a- 
extends to a unique linear isomorphism of E onto £". The equality in the 
theorem implies that for a, f3 £ A we have 

*/(«)/(/?) = /(/?) - (/(/?), /(a))/(a) = f{s a [3). 

Since the Weyl groups are generated by these simple reflections, this implies 
that the map 

w i— » / o w o / _1 

is an isomorphism of W onto Vy' . Every /3 <G <I> is of the form w(a) where w £ W 
and a is a simple root. Thus 

f(P) = fowof-\f(a) e$' 

so /(<&) = Since s a (/3) = /?— a)a, the number a) is determined by the 
reflection s a acting on (3. But then the corresponding formula for <&' together 
with the fact that 

Sf(a) = fosaof- 1 

implies that 

(/(/?),/(«)} = (/3, a). 

QED 

6.7 The classification of the possible simple Lie 
algebras. 

Suppose that g, h, is a pair consisting of a semi-simple Lie algebra g, and a 
Cartan subalgebra h. This determines the corresponding Euclidean space E 
and root system Suppose we have a second such pair (g', h'). We would like 
to show that an isomorphism of (E,Q) with (E',&) determines a Lie algebra 
isomorphism of g with g'. This would then imply that the Dynkin diagrams 
classify all possible simple Lie algebras. We would still be left with the problem 
of showing that the exceptional Lie algebras exist. We will defer this until 
Chapter VIII where we prove Serre's theorem with gives a direct construction 
of all the simple Lie algebras in terms of generators and relations determined 
by the Cartan matrix. 

We need a few preliminaries. 

Proposition 24 Every positive root can be written as a sum of simple roots 
in such a way that every partial sum is again a root. 
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Proof. By induction (on say the height) it is enough to prove that for every 
positive root (3 which is not simple, there is a simple root a such that (3 — a is 
a root. We can not have (/?, a) < for all a e A for this would imply that the 
set {/?} U A is independent (by the same method that we used to prove that A 
was independent). So (/?, a) > for some a £ A and so (3 — a is a root. Since 
(3 is not simple, its height is at least two, and so subtracting a will not be zero 
or a negative root, hence positive. QED 

Proposition 25 Let g, h be a semi-simple Lie algebra with a choice of Cartan 
subalgebra, Let $ be the corresponding root system, and let A be a base. Then 
g is generated as a Lie algebra by the subspaces g Q ,g- Q ,a e A. 

From the representation theory of sl(2) a we know that [g Q , gp] = ga+p if a + (3 
is a root. Thus from the preceding proposition, we can successively obtain all 
the g/3 for (3 positive by bracketing the g a , a E A. Similarly we can get all 
the gp for (3 negative from the g_ Q . So we can get all the root spaces. But 
[g Q ,g_ Q ] = Ch a so we can get all of h. The decomposition 

g = h®0g 7 

76* 

then shows that we have generated all of g. 
Here is the big theorem: 

Theorem 15 Let g, h and g', h' be simple Lie algebras with choices of Cartan 
subalgebras, and let <&,<!>' be the corresponding root systems. Suppose there is 
an isomorphism 

/:(£?, $)->(£',$') 
which is an isometry of Euclidean spaces. Extend f to an isomorphism of 

h* -> h'* 

via complexification. Let f : h — > h' denote the corresponding isomorphism on 
the Cartan subalgebras obtained by identifying h and h' with their duals using 
the Killing form. 

Fix a base A of $ and A' of <&'. Choose 0^i„G g a , a € A and ^ x' a , e 
g' a , . Extend f to a linear map 

f :h® 0g Q ^h'® g a , 

aeA a'eA' 

by 

f( x a) = X a i . 

Then f extends to a unique isomorphism of g — > g' . 

Proof. The uniqueness is easy. Given x a there is a unique y a e g_ Q for which 
] = h a so /, if it exists, is determined on the y a and hence on all of g 
since the x a and y a generate g be the preceding proposition. 
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To prove the existence, we will construct the graph of this isomorphism. 
That is, we will construct a subalgebra k of g © g' whose projections onto the 
first and onto the second factor are isomorphisms: 

Use the x a and y a as above, with the corresponding elements x' a , and y' a , in 
g'. Let 

x a :— x a ffi x a , G g ffi g 

and similarly define 

y a ■= y a ®y' a >, 

and 

h a '.— h a ffi h a , . 

Let be the (unique) maximal root of g, and choose x G g/3. Make a similar 
choice of x' G g'p, where 0' is the maximal root of g'. Set 

x := x © x'. 

Let m C g ffi g' be the subspace spanned by all the 

ady aii ---ady^x. 
The clement ad y a .^ • • • ad y a . x belongs to gp-j^ Qj ffi g^<_^r a > so 

m H (g/3 ffi gp, ) is one dimensional . 

In particular m is a proper subspace of g ffi g'. 

Let k denote the subalgebra of g ffi g' generated by the x a the y a and the 
h a . We claim that 

[k, m] C m. 

Indeed, it is enough to prove that m is invariant under the adjoint action of the 
generators of k. For the ady Q this follows from the definition. For the a,dh a we 
use the fact that 

[h,y a ] = -a(h)y a 

to move the a,dh a past all the ady 7 at the cost of introducing some scalar 
multiple, while 

ad h a x — (0, a)xp + (0' , a')x'p, — (0, a)x 

because / is an isomorphism of root systems. 

Finally [x ai ,y a2 ] = if a\ ^ a 2 6 A since a\ — a 2 is not a root. On the 
other hand [x Q ,y a ] = h a . So we can move the &dx a past the ady 7 at the 
expense of introducing an &dh a every time 7 = a. Now a + is not a root, 
since is the maximal root. So [x a ,X/j\ = 0. Thus &dx a x = 0, and we have 
proved that [k, m] C m. But since m is a proper subspace of gffig', this implies 
that k is a proper subalgebra, since otherwise m would be a proper ideal, and 
the only proper ideals in g ffi g' are g and g'. 
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Now the subalgebra k can not contain any element of the form z © 0, z ^ 0, 
for it if did, it would have to contain all of the elements of the form u © since 
we could repeatedly apply adx a 's until we reached the maximal root space and 
then get all of g © 0, which would mean that k would also contain all of © g' 
and hence all of g © g' which we know not to be the case. Similarly k can not 
contain any element of the form 0(Bz'. So the projections of k onto g and onto g' 
are linear isomorphisms. By construction they are Lie algebra homomorphisms. 
Hence the inverse of the projection of k onto g followed by the projection of k 
onto g' is a Lie algebra isomorphism of g onto g'. By construction it sends x a 
to x' a , and h a to h a i and so is an extension of /. QED 



Chapter 7 

Cyclic highest weight 
modules. 

In this chapter, g will denote a semi-simple Lie algebra for which we have chosen 
a Cartan subalgebra, h and a base A for the roots $ = <f> + U $~ of g. 

We will be interested in describing its finite dimensional irreducible repre- 
sentations. If W is a finite dimensional module for g, then h has at least one 
simultaneous eigenvector; that is there is a [i e h* and a, w ^ e W such that 

hw = n{h)w V/iGh. (7.1) 

The linear function fj, is called a weight and the vector v is called a weight 
vector. If x £ g a , 

hxw = [h, x]w + xhw = (yu + a){h)xw. 

This shows that the space of all vectors w satisfying an equation of the type 
(7.1) (for varying /i) spans an invariant subspace. If W is irreducible, then the 
weight vectors (those satisfying an equation of the type (7.1)) must span all of 
W. Furthermore, since W is finite dimensional, there must be a vector v and a 
linear function A such that 

hv = X(h)v V/ieh, e a v = 0, Va e $ + . (7.2) 

Using irreducibility again, we conclude that 

W = U(g)v. 

The module is cyclic generated by v. In fact we can be more precise: Let 
hi,... ,he be the basis of h corresponding to the choice of simple roots, let 
ej € g ai , fi € g-a 4 where a.\, . . . , a m are all the positive roots. (We can choose 
them so that each e and / generate a little sl{2).) Then 

g = n_®h®n+, 
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where e±, . . . , e m is a basis of n + , where hi, . . . , hf is a basis of h, and fi, . . . , f m 
is a basis of n_. The Poincare-Birkhoff-Witt theorem says that monomials of 
the form 

Jl Jm n l < l l e l e m 

form a basis of t/(g). Here we have chosen to place all the e's to the extreme 
right, with the h's in the middle and the /'s to the left. It now follows that the 
elements 

/i 1 ■■■/£•« 

span VT. Every such element, if non-zero, is a weight vector with weight 

A - («iai H h i m a m ). 

Recall that 

fj, -< A means that A — /i = ^ fciai, > 0, 

where the fci are non-negative integers. We have shown that every weight \i of 
W satisfies 

fi -< A. 

So we make the definition: A cyclic highest weight module for g is a module 
(not necessarily finite dimensional) which has a vector v + such that 

x+v+ = 0, V x+ G n + , hv + = \{h)v + \/h e h 

and 

v = u( s )v+. 

In any such cyclic highest weight module every submodule is a direct sum of its 
weight spaces (by van der Monde). The weight spaces V M all satisfy 

/j, -< A 

and we have 

Any proper submodule can not contain the highest weight vector, and so the 
sum of two proper submodules is again a proper submodule. Hence any such V 
has a unique maximal submodule and hence a unique irreducible quotient. The 
quotient of any highest weight module by an invariant submodule, if not zero, 
is again a cyclic highest weight module with the same highest weight. 

7.1 Verma modules. 

There is a "biggest" cyclic highest weight module, associated with any A e h* 
called the Verma module. It is defined as follows: Let us set 



b := h® n + . 
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Given any A € h* let C\ denote the one dimensional vector space C with 
basis z + and with the action of b given by 

(h+ ^x /3 )z + := X(h)z+. 

So it is a left U(b) module. By the Poincare Birkhoff Witt theorem, U (g) is a 
free right Uih) module with basis {fl 1 ■ ■ ■ f\ m }, and so we can form the Verma 
module 

Verm(A) := [7(g) ® u{h) C A 

which is a cyclic module with highest weight vector v + := 1 ® z+. 

Furthermore, any two irreducible cyclic highest weight modules with the 
same highest weight are isomorphic. Indeed, if V and W are two such with 
highest weight vector v+, it+, consider V W which has (v+, u+) as a maximal 
weight vector with weight A, and hence Z := U(g)(v+,u+) is cyclic and of 
highest weight A. Projections onto the first and second factors give non-zero 
homomorphisms which must be surjective. But Z has a unique irreducible 
quotient. Hence these must induce isomorphisms on this quotient, V and W 
are isomorphic. 

Hence, up to isomorphism, there is a unique irreducible cyclic highest weight 
module with highest weight A. We call it 

Irr(A). 

In short, we have constructed a "largest" highest weight module Verm(A) 
and a "smallest" highest weight module Irr(A). 

7.2 When is dimlrr(A) < oo? 

If Irr(A) is finite dimensional, then it is finite dimensional as a module over any 
subalgebra, in particular over any subalgebra isomorphic to sZ(2). Applied to 
the subalgebra sZ(2)j generated by a, hi, fi we conclude that 

X(hi) G Z. 

Such a weight is is called integral. Furthermore the representation theory of 
sl(2) says that the maximal weight for any finite dimensional representation 
must satisfy 

X(hi) - (X,Oi) > 

so that A lies in the closure of the fundamental Weyl chamber. Such a weight is 
called dominant. So a necessary condition for Irr(A) to be finite dimensional 
is that A be dominant integral. We now show that conversely, Irr(A) is finite 
dimensional whenever A is dominant integral. 

For this we recall that in the universal enveloping algebra U(g) we have 



1. [ ej ,/f +1 ]=0,if^j 
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2. [h j ,f* +1 ] = -(k + l) ai (h j )f* +1 

3. [e i ,f? +1 ] = -(k + l)ft(k-l-h i ) 

where the first two equations are consequences of the fact that ad is a derivation 
and 

fj] — if i 7^ j since a, — Qj is not a root 

and 

The last is a the fact about sZ(2) which we have proved in Chapter II. Notice 
that it follows from 1.) that ej{f^)v + = for all k and all i ^ j and from 3.) 
that 

ej^ )+ V = 

so that f^ h ^ +1 v + is a maximal weight vector. If it were non-zero, the cyclic 
module it generates would be a proper submodule of Irr(A) contradicting the 
irrcducibility. Hence 

= o. 

So for each i the subspace spanned by v + , fav + , • • • , f^ hi \ + is a finite dimen- 
sional sl(2)i module. In particular Irr(A) contains some finite dimensional s/(2)j 
modules. Let V denote the sum of all such. If W is a finite dimensional sZ(2)j 
module, then e a W is again finite dimensional, thus so their sum, which is a 
finite dimensional sl(2)i module. Hence V' is g- stable, hence all of Irr(A). 

In particular, the and the /, act as locally nilpotent operators on Irr(A). 
So the operators Tj := (expei)(exp— /j)(expej) are well defined and 

r i (Irr(A))^=Irr(A) SiP 

so 

dimIrr(A) WM = dimIrr(A) M ViueW (7.3) 

where W denotes the Weyl group. These are all finite dimensional subspaces: 
Indeed their dimension is at most the corresponding dimension in the Verma 
module Verm(A), since Irr(A) M is a quotient space of Verm(A)^. But Verm(A)^ 
has a basis consisting of those f l 1 • • • f%™v + . The number of such elements is 
the number of ways of writing 

X- li = k x OL X H k m a m . 

So dim Verm(A) Ai is the number of m-tuplets of non-negative integers (fci, . . . , k m ) 
such that the above equation holds. This number is clearly finite, and is known 
as Pff(A — /i), the Kostant partition function of A — /x, which will play a central 
role in what follows. 

Now every element of E is conjugate under W to an clement of the closure 
of the fundamental Weyl chamber, i.e. to a fi satisfying 



(//,£*») > 
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i.e. to a fi that is dominant. Wc claim that there are only finitely many dominant 
weights ji which arc < A, which will complete the proof of finite dimensionality 
Indeed, the sum of two dominant weights is dominant, so A + \i is dominant. 
On the other hand, A — \l = J2 kitti with the fc, > 0. So 

(A, A) - (/x, u) = (A + ijl, A - n) = ki(X + \i, a») > 0. 

So jU lies in the intersection of the ball of radius a/ (A, A) with the discrete set 
of weights -< A which is finite. 

We record a consequence of (7.3) which is useful under very special circum- 
stances. Suppose we are given a finite dimensional representation of g with the 
property that each weight space is one dimensional and all weights arc conju- 
gate under W. Then this representation must be irreducible. For example, take 
g = sl(n + 1) and consider the representation of g on A fc (C™ +1 ), 1 < k < n. In 
terms of the standard basis e l7 . . . , e n+1 of C n+1 the elements A • • • A e ik are 
weight vectors with weights L il + ■ ■ ■ + L ik , Where h consists of all diagonal 
traceless matrices and Lj is the linear function which assigns to each diagonal 
matrix its i-th entry. 

These weight spaces are all one dimensional and conjugate under the Weyl 
group. Hence these representations are irreducible with highest weight 

LOi := L\ + ■ ■ ■ + Lh 

in terms of the usual choice of base, hi,...,h n where hj is the diagonal matrix 
with 1 in the j-th position, —1 in the j + 1-st position and zeros elsewhere. 
Notice that 

u)i(hj) = Sij 

so that the form a basis of the "weight lattice" consisting of those A e h* 
which take integral values on hi, . . . , h n . 

7.3 The value of the Casimir. 

Recall that our basis of U(g) consists of the elements 

/*i . . . fimhi 1 ■ ■ ■ h^ e p kl ■ ■ ■ p km 
J I Jm n \ a l e l t m ■ 

The elements of U(h) are then the ones with no e or / component in their 
expression. So we have a vector space direct sum decomposition 

(7(g)- U(h) e([/(g)n + +n_C/(g)), 

where n + and n_ are the corresponding nilpotent subalgebras. Let 7 denote 
projection onto the first factor in this decomposition. Now suppose z € Z(g), 
the center of the universal enveloping algebra. In particular, z € U(g) h . The 
eigenvalues of the monomial above under the action of h G h are 

m 

^2(k s - i s )a s (h). 

8=1 
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So any monomial in the expression for z can not have / factors alone. We have 
proved that 

z--f(z) eC/(g)n+, VzGZ(g). (7.4) 

For any A G h*, the element z G Z(g) acts as a scalar, call it xx( z ) on the 
Verma module associated to A. 

In particular, if A is a dominant integral weight, it acts by this same scalar 
on the irreducible finite dimensional module associated to A. 

On the other hand, the linear map A : h — > C extends to a homomorphism, 
which we will also denote by A of U(h) = S(h) — > C. Explicitly, if we think of 
elements of [/(h) = 5(h) as polynomials on h*, then A(P) = P(\) for P G S(h). 
Since n + v = if v is the maximal weight vector, we conclude from (7.4) that 

X x(z) = \(-r(z)) VzGZ(g). (7.5) 

We want to apply this formula to the second order Casimir element associated 
to the Killing form k. So let ki, . . . , kg G h be the dual basis to hi,...,he 
relative to k, i.e. 

K.(hi ,kj) — 8ij . 

Let x a G g Q be a basis (i.e. non-zero) clement and z a G g_ Q be the dual basis 
element to x a under the Killing form, so the second order Casimir element is 

Cas -— ^ ^ h^k>i ~\~ ^ ^ XqZq. 

a 

where the second sum on the right is over all roots. We might choose the 
x a — e a for positive roots, and then the corresponding z a is some multiple of 
the f a . (And, for present purposes we might even choose f a = z a for positive 
a.) The problem is that the z a for positive a in the above expression for Cas K 
are written to the right, and we must move them to the left. So we write 

Cas = ^ ' hiki + ^ ' [ffq, z a ] + ^ ' z a x a + ^ ' x a z a . 

i a>0 Q>0 q<0 

This expression for Cas K has all the n + elements moved to the right; in partic- 
ular, all of the summands in the last two sums annihilate v\. Hence 

7(Cas K ) = ^2 hiki + ^ [x a , z a ) 

i ct>0 

and 

XA(Cas K ) - Hh t )X(h) + J2 K[x a , Za})- 

For any ft, G h we have 

n(h, [x a ,z a ]) = K([h,x a ],z a ) = a(h)K(x a ,z a ) = a(h) 

so 

Xa 7 Z a ] t a 
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where t a £ h is uniquely determined by 

n(t a ,h) = a(h) V h e h. 

Let ( , ) K denote the bilinear form on h* obtained from the identification of h 
with h* given by n. Then 

where 

i) : - : 



On the other hand, let the constants ai be defined by 



\(h) = V" a,iK(hi, h) V/ieh. 
In other words A corresponds to ^ a,/^ under the isomorphism of h with h* so 

(A, A )k = E a i a j K ( h i> h j)- 

Since n{hi,kj) = Sij we have 

A(fcj) = Oj. 

Combined with A(/ij) = ^ . ajK(hj,hi) this gives 

(A,A) K = ^A(^)A(fc,)- (7-7) 

Combined with (7.6) this yields 

X A (Cas K ) = (A + p, A + p) K - (p, p) K . (7.8) 

We now use this innocuous looking formula to prove the following: We let 
L = L g C denote the lattice of integral linear forms on h, i.e. 

L = { M eh*|2^4* eZV^eA}. (7.9) 

(?) 

L is called the weight lattice of g. 

For p,, A € L recall that 

fj, -< A 

if A — p is a sum of positive roots. Then 
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Proposition 26 Any cyclic highest weight module Z(X),X £ L has a composi- 
tion series whose quotients are irreducible modules, Irr(/i) where p, -< X satisfies 

(p + p,p + p) K = (\ + p,\ + p) K . (7.10) 

In fact, if 

d = dim Z(X) fl 

where the sum is over all p satisfying (7.10) then there are at most d steps in 
the composition series. 

Remark. There arc only finitely many p £ L satisfying (7.10) since the set 
of all p satisfying (7.10) is compact and L is discrete. Each weight is of finite 
multiplicity. Therefore d is finite. 

Proof by induction on d. We first show that if d = 1 then Z(X) is irreducible. 
Indeed, if not, any proper submodule W, being the sum of its weight spaces, 
must have a highest weight vector with highest weight p, say. But then 

X A(Cas K )= XAl (Cas K ) 

since W is a submodule of Z(X) and Cas K takes on the constant value XA(Cas K ) 
on Z(X). Thus p and A both satisfy (7.10) contradicting the assumption d=l. 
In general, suppose that Z(X) is not irreducible, so has a submodule, W and 
quotient module Z(X)/W. Each of these is a cyclic highest weight module, 
and we have a corresponding composition series on each factor. In particular, 
d = dw + d z r\)/w so that the d's are strictly smaller for the submodule and the 
quotient module. Hence we can apply induction. QED 

For each A £ L we introduce a formal symbol, e(A) which we want to think 
of as an "exponential" and so the symbols are multiplied according to the rule 

e{p)-e{v)=e{p + v). (7.11) 

The character of a module N is defined as 

chjv = ^ dim ■ e{p). 

In all cases we will consider (cyclic highest weight modules and the like) all 
these dimensions will be finite, so the coefficients are well defined, but (in the 
case of Verma modules for example) there may be infinitely many terms in the 
(formal) sum. Logically, such a formal sum is nothing other than a function on 
L giving the "coefficient" of each e(p). 

In the case that N is finite dimensional, the above sum is finite. If 

f = /Z -A^O) and 9 = ^9 v e{v) 

are two finite sums, then their product (using the rule (7.11)) corresponds to 
convolution: 

(E^m) • (E^h) = E(/*sM a ) 
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where 

{f*g)\ ■= -f» 9u - 

So we let Zfi n (L) denote the set of Z valued functions on L which vanish outside 
a finite set. It is a commutative ring under convolution, and we will oscillate 
in notation between writing an element of Zfi n (L) as an "exponential sum" 
thinking of it as a function of finite support. 

Since we also want to consider infinite sums such as the characters of Verma 
modules, we enlarge the space Zfi n (L) by defining Z gon (L) to consist of Z val- 
ued functions whose supports are contained in finite unions of sets of the form 
A — ^2 a y k a a. The convolution of two functions belonging to Z gcn (L) is well 
defined, and belongs to Z gon (L). So Z gon (L) is again a ring. 

It now follows from Prop. 26 that 

chz(A) = Cn Irr(M) 

where the sum is over the finitely many terms in the composition series. In 
particular, we can apply this to Z(X) = Vcrm(A), the Verma module. Let us 
order the \ii -< A satisfying (7.10) in such a way that /Xj -<, jij => i < j. Then 
for each of the finitely many /i, occurring we get a corresponding formula for 
chverm(^) and so we get collection of equations 

Chvcrm(^ 3 ) = ^ dijCh Irr(ji<) 

where an = 1 and i < j in the sum. We can invert this upper triangular matrix 
and therefore conclude that there is a formula of the form 

chl rr(A ) = Y 6 (M) ch Vorm( M ) (7-12) 

where the sum is over /x -< A satisfying (7.10) with coefficients &(/i) that we shall 
soon determine. But we do know that 6(A) = 1. 

7.4 The Weyl character formula. 

We will now prove 

Proposition 27 The non-zero coefficients in (7.12) occur only when 

fi = w(X + p) - p 
where w G W, the Weyl group of g, and then 

Km) = (-ir 



Here 



(-l) w := detio. 
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We will prove this by proving some combinatorial facts about multiplication of 
sums of exponentials. 

We recall our notation: For A e h*, Irr(A) denotes the unique irreducible 
module of highest weight, A, and Verm(A) denotes the Verma module of highest 
weight A, and more generally, Z{\) denotes an arbitrary cyclic module of highest 
weight A. Also 

is one half the sum of the positive roots. Let A,, i = 1, . . . , dim h be the basis of 
the weight lattice, L dual to the base A. So 

\{h aj ) = {K,otj) = Sij. 

Since Sj(a!j) = — on while keeping all the other positive roots positive, we saw 
that this implied that 

Sip = p - on 

and therefore 

(p,di) = l, i = !,...,£ := dim(h). 

In other words 

P= \ Y, 4>=Ai + --- + A^. (7.13) 

4>e®+ 

The Kostant partition function, Pk{p) is defined as the number of sets 
of non-negative integers, kp such that 

p = Y k pP- 

(The value is zero if /i can not be expressed as a sum of positive roots.) 

For any module N and any p E h* , denotes the weight space of weight p,. 
For example, in the Verma module, Verm (A), the only non-zero weight spaces 
are the ones where p = A — X)/3e$+ and the multiplicity of this weight 
space, i.e. the dimension of Verm(A) M is the number of ways of expressing in 
this fashion, i.e. 

dim Verm(A) (U = P K (X - p)- (7.14) 

In terms of the character notation introduced in the preceding section we 
can write this as 

chvorm(A) = ^ ^(A - /l)e(/l). 

To be consistent with Humphreys' notation, define the Kostant function p by 

p(u) - P K {-v) 

and then in succinct language 



chvorm(A) = P{' ~ A). 



(7.15) 
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Observe that if 

/ = £/(mMm) 

then _ 

/ • e(A) = J2 /(M)e(A + p) = £ f(v - A)e(i/). 

We can express this by saying that 

/•e(A) = /(•- A). 

Thus, for example, 

chvcrm(A) = p(- ~ A) = p ■ e(A). 

Also observe that if 

f a = := 1 + e{-a) + e(-2a) + • • • 

1 - e(— a) 

then 

(1 - e(-a))/ Q = 1 

and 

by the definition of the Kostant function. 
Define the function q by 

q : = n ( g («/ 2 ) - e (-«/2)) = e ^ ri( i - e (-«)) 

since e(p) = e(a/2). Notice that 

«K?=(-ir«. 

It is enough to check this on fundamental reflections, but they have the property 
that they make exactly one positive root negative, hence change the sign of q. 
We have 

qp = e(p). (7.16) 



Indeed, 



qpe(-p) = [JJ(1 - e(-a)) e{p)pe{-p) 
= [JJ(1- e (-a))]p 

= n( i - e (-«))n./'« 



= i. 



Therefore, 



gchvorm(A) = <7Pe(A) = e(p)e(A) = e(A + p). 
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Let us now multiply both sides of (7.12) by q and use the preceding equation. 
We obtain 

<?ch Irr(A) = %)e(M + P) 

where the sum is over all p < A satisfying (7.10), and the b(p) are coefficients 
we must determine. 

Now ch Irr ( A ) is invariant under the Weyl group W, and q transforms by 
(— l) w . Hence if we apply w £ W to the preceding equation we obtain 

(-l)*"gchi rr (A) = 6 (/ i ) e ( w (M + P))- 

This shows that the set of p + p with non-zero coefficients is stable under W 
and the coefficients transform by the sign representation for each W orbit. In 
particular, each element of the form p — w(X+p) —p has ( — l) w as its coefficient. 
We can thus write 

qch v(x) = J2 (-l) W e(w(X + p)) + R 

wew 

where R is a sum of terms corresponding to p + p which are not of the form 
w(X + p). We claim that there are no such terms and hence R = 0. Indeed, if 
there were such a term, the transformation properties under W would demand 
that there be such a term with p + p in the closure of the Weyl chamber, i.e. 

pipe A:=LnO 

where 

D = D s = {A e E\(X,(t>) > V</>eA+} 

and E = denotes the space of real linear combinations of the roots. But we 
claim that 

p -< A, (p + p, p + p) = (A + p, A + p), & p + p e A => p = A. 

Indeed, write p = X — n, n = Y1 k a a, k a > so 

= {X + p,X + p) - (p + p,p + p) 

= (X + p, X + p) - (A + p — 7r, A + p - it) 
= (A + p,ir) + (n,p + p) 

> (A + p, 7r) since p + p e A 

> 

since A + p e A and in fact lies in the interior of D. But the last inequality is 
strict unless tt = 0. Hence it = 0. We will have occasion to use this type of 
argument several times again in the future. In any event we have derived the 
fundamental formula 

<Zch Irr(A) = ]T (-l) w e(w(X + p)). (7.17) 

wew 
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Notice that if we take A = and so the trivial representation with character 
1 for V{\), (7.17) becomes 

q = Y,{-^) w e{wp) 
and this is precisely the denominator in the Weyl character formula: 



WPF ch IWMreMA + p)) 

WCF chMA) = Y-^YMM (? - 18) 



7.5 The Weyl dimension formula. 

For any weight, ji we define 

Then we can write the Weyl character formula as 



ch 



Irr(A) 



For any weight /x define the homomorphism W M from the ring Zfi n (L) into 
the ring of formal power series in one variable t by the formula 

*„(e(i/)) - e^-' 

(and extend linearly) . The left hand side of the Weyl character formula belongs 
to Zfi n (L), and hence so does the right hand side which is a quotient of two 
elements of Zg n (L). Therefore for any /i we have 

MA,) - *„(^ M ) (7.19) 
for any pair of weights. Indeed, 

- ^(_ 1) - e (M,w) Kt 
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In particular, 



= *a(«) 



* A (j](e(a/2)-e(-a/2))) 

n( 



e (X, a ) K t/2 _ e -(X,a)„t/2)\ 



Q^[(A, a) K ^ + terms of higher degree in t. 



Hence 

iw v, ^ *p(^+ P ) n( A + ^ a )K . i , ... , 

W p (ch Irr m) = = — h terms ot positive degree m t. 

y p ( A p) 1 !(/>,«)« 

Now consider the composite homomorphism: first apply and then set t = 
0. This has the effect of replacing every e(p) by the constant 1. Hence applied to 
the left hand side of the Weyl character formula this gives the dimension of the 
representation Irr(A). The previous equation shows that when this composite 
homomorphism is applied to the right hand side of the Weyl character formula, 
we get the right hand side of the Weyl dimension formula: 

dimIrr(A)= n "^ +(A + P f )K - (7-20) 



7.6 The Kostant multiplicity formula. 

Let us multiply the fundamental equation (7.17) by pe{—p) and use the fact 
(7.16) that qpe(-p) = 1 to obtain 

chirr (A) = ^(-l)>NeWA+p)). 

wew 

But 

pe(-p)e(w(\ + p)) = p(- - w(X + p)+ p) 

or, in more pedestrian terms, the left hand side of this equation has, as its 
coefficient of e(p) the value 

p(p + p- w(\ + p)). 

On the other hand, by definition, 

chirr(A) = ^dim(Irr(A) p e(^). 

We thus obtain Kostant 's formula for the multiplicity of a weight \x in the 
irreducible module with highest weight A: 

KMF dim(Irr(A))„= ^ (-l) w p(^ + p-w(\ + p)). (7.21) 

wew 
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It will be convenient to introduce some notation which simplifies the appearance 
of the Kostant multiplicity formula: For w E W and p E L (or in E for that 
matter) define 

w p := w(p + p) - p. (7.22) 

This defines another action of W on E where the "origin of the orthogonal 
transformations w has been shifted from to —p". Then we can rewrite the 
Kostant multiplicity formula as 

dim(Irr(A)) M = £ {-l) w P K {w Q X - fx) (7.23) 

or as 

ch(Irr(A)) = Y,(-^ Wp K( w A ~ (7.24) 

w£W p 

where Pk is the original Kostant partition function. 

For the purposes of the next section it will be useful to record the following 
lemma: 

Lemma 14 If u is a dominant weight and e ^ w e W then w v is not 
dominant. 

Proof. If v is dominant, so lies in the closure of the positive Weyl chamber, 
then v + p lies in the interior of the positive Weyl chamber. Hence if w ^ e, 
then w{v + p)(hi) < for some i, and so w v — w(u + p) — p is not dominant. 
QED 

7.7 Steinberg's formula. 

Suppose that A' and A" are dominant integral weights. Decompose Irr(A') 
Irr(A") into irreducibles, and let n(X) = n(A, A' A") denote the multiplicity of 
Irr(A) in this decomposition into irreducibles (with n(X) = if Irr(A) does not 
appear as a summand in the decomposition). In particular, n{y) = if v is not 
a dominant weight since Irr(^) is infinite dimensional in this case, so can not 
appear as a summand in the decomposition. In terms of characters, we have 

ch(Irr(A')) ch(Irr(A")) = ^ n(X) ch(Irr(A)). 

A 

Steinberg's formula is a formula for n(X). To derive it, use the Weyl character 
formula 

ch(Irr(A")) = ^±£, ch(Irr(A)) = ^ 
in the above formula to obtain 

ch(hv(X'))A x „ +p = J2n(X)A x+p . 

A 
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Use the Kostant multiplicity formula (7.24) for A': 

ch(Irr(A'))= ]T ^(-irPx(™OA'- M )e(M) 
wew fj. 

and the definition 

A X „ +P = £(-l)»e(«(A" + p)) 

u£W 

and the similar expression for A\ +p to get 

E (-ir^(^0A / -M))e(n(A" + p)+ M ) = 

EI>(A)(-ireHA + p)). 

A uj 

Let us make a change of variables on the right hand side, writing 

v = w(d X 

so the right hand side becomes 

X^-irn^Oi/Miz + p). 

If is a dominant weight, then by Lemma 14 w _1 0^ is not dominant if w^ 1 ^ e. 
So n(w^ 1 i/) = if w ^ 1 and so the coefficient of e(u + p) is precisely n(v) 
when v is dominant. 

On the left hand side let 

[i = v - u A" 

to obtain 

E (-1)"™Pk(w A' + u A" - v)e(v + p). 

Comparing coefficients for v dominant gives 

n(u) = J2(- l ) UWp K(w QX' + uQX" - v). (7.25) 

u.w 

7.8 The Freudenthal - de Vries formula. 

We return to the study of a semi-simple Lie algebra g and get a refinement of 
the Weyl dimension formula by looking at the next order term in the expansion 
we used to derive the Weyl dimension formula from the Weyl character formula. 
By definition, the Killing form restricted to the Cartan subalgebra h is given 

by 

n(h,ti) = ^a{h)a{ti) 
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where the sum is over all roots. If /i, A e h* with t^,t\ the elements of H 
corresponding to them under the Killing form, we have 

a 



so 



For each A in the weight lattice L we have let e(A) denote the "formal 
exponential" so Zf in (L) is the space spanned by the e(A) and we have defined 
the homomorphism 

*„ : Z /in (A) - C[[t]], e(A)~e( A -"K 

Let N and D be the images under \t p of the Weyl numerator and denominator. 
So 

N = * p (A p+x ) = Hf p+x (A p ) 

by (7.19) and 

A p = q= [] (e a/2 -er^ 2 ) (7.27) 

and therefore 

7V(t) = JJ ^ e (A+p,a) K i/2 _ e -(A+p, Q ) K t/2^ 
a>0 



with a similar formula for D. Then N/D — > rf(A) = the dimension of the 
representation as t — > is the usual proof (that we reproduced above) of the 
Weyl dimension formula. Sticking this in to N/D gives 

^ = d(A) ( 1 + 1 £[(A + P, ") 2 K - (p, «)^ 2 + ■■■). 

V a>0 / 

For any weight \x we have (p,, /u) re = XXMj a ) 2 by (7.26), where the sum is over 
all roots so 

^ = d + ^[(A + p, A + p) K - (p, p) K ]t 2 + • 

and we recognize the coefficient of jgi 2 in the above expression as XA(Cas K )), 
the scalar giving the value of the Casimir associated to the Killing form in the 
representation with highest weight A. 

On the other hand, the image under \I> p of the character of the irreducible 
representation with highest weight A is 

E eMKt - E(! + p)j + \^ p)l t2 + ■■■) 
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where the sum is over all weights in the irreducible representation counted with 
multiplicity. Comparing coefficients gives 



£(M,p)2=^d(A) XA (Cas'«). 



24 

Applied to the adjoint representation the left hand side becomes (p,p) K by 
(7.26), while d(X) is the dimension of the Lie algebra. On the other hand, 
XA(Cas K ) = 1 since trad(Cas K ) = dim(g) by the definition of Cas K . So we get 

(p,p)K-^dimg (7.28) 

for any semisimple Lie algebra g. 

An algebra which is the direct sum a commutative Lie and a semi-simple 
Lie algebra is called reductive. The previous result of Freudenthal and deVries 
has been generalized by Kostant from a semi-simple Lie algebra to all reductive 
Lie algebras: Suppose that g is merely reductive, and that we have chosen a 
symmtric bilinear form on g which is invariant under the adjoint representation, 
and denote the associated Casimir clement by Cas g . We claim that (7.28) 
generalizes to 

ltrad(Cas g ) = (M- (7-29) 

(Notice that if g is semisimple and we take our symmetric bilinear form to be 
the Killing form ( , ) K (7.29) becomes (7.28).) To prove (7.29) observe that 
both sides decompose into sums as we decompose g into as sum of its center 
and its simple ideals, since this must be an orthogonal decomposition for our 
invariant scalar product. The contribution of the center is zero on both sides, 
so we are reduced to proving (7.29) for a simple algebra. Then our symmetric 
biinear form ( , ) must be a scalar multiple of the Killing form: 

(, )=c 2 (, )« 

for some non-zero scalar c. If zi, . . . , zn is an orthonormal basis of g for ( , ) K 
then zi/c, . . . , zn/c is an orthonormal basis for ( , ). Thus 

Cas e — —77 Cas 
s c 2 

So | 

trad(CaSg) = trad(Cas K ) = — —dim g. 

But on h* we have the dual relation 

(p,p) = \{p,p)k- 

Combining the last two equations shows that (7.29) becomes (7.28). 
Notice that the same proof shows that we can generalize (7.8) as 

X A (Cas) = (A + p,A + p)-(p,p) (7.30) 

valid for any reductive Lie algebra equipped with a symmetric bilinear form 
invariant under the adjoint representation. 
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7.9 Fundamental representations. 



We let oJi denote the weight which satisfies 



Ui(hj) = S lj 



so that the u>i form an integral basis of L and are dominant. We call these 
the basic weights. If (V, p) and (W, a) are two finite dimensional irreducible 
representations with highest weights A and a, then V ® W, p (£> a contains the 
irreducible representation with highest weight A + /i, and highest weight vector 
v\ <g> w^, the tensor product of the highest weight vectors in V and W. Tak- 
ing this "highest" component in the tensor product is known as the Cartan 
product of the two irreducible representations. 

Let (Vi,pi) be the irreducible representations corresponding to the basic 
weight u>i. Then every finite dimensional irreducible representation of g can be 
obtained by Cartan products from these, and for that reason they are called the 
fundamental representations. 

For the case of A n = sl(n + 1) we have already verified that the fundamental 
representations are l\ k (V) where V = C n+1 and where the basic weights are 



We now sketch the results for the other classical simple algebras, leaving the 
details as an exercise in the use of the Weyl dimension formula. 

For C n = sp(2n) it is immediate to check that these same expressions give the 
basic weights. However while V = C 2 " = A 1 (V r ) is irreducible, the higher order 
exterior powers are not: Indeed, the symplectic form £1 G A 2 (V*) is preserved, 
and hence so is the the map 



given by contraction by Ct. It is easy to check that the image of this map 
is surjective (for j = 2, ...,n). the kernel is thus an invariant subspace of 
dimension 



and a (not completely trivial) application of the Weyl dimension formula will 
show that these are indeed the dimensions of the irreducible representations 
with highest weight ojj . Thus these kernels are the fundamental representations 
of C n . Here are some of the details: 



i = L\ + ■ ■ ■ + Li 



A j {V) -» A J - 2 (y) 




We have 




The most general dominant weight is of the form 




n 
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where 

oi = ki H h fc n , a 2 = k 2 -\ 1- k n , ■ ■ ■ a n = k n 

where the ki are non-negative integers. So we can equally well use any decreasing 
sequence Oi > a 2 > ■ ■ ■ > a n > of integers to parameterize the irreducible 
representations. We have 

(p, Li - Lj) =j-i, 0, Li + Lj) = 2n+2-i- j. 

Multiplying these all together gives the denominator in the Weyl dimension 
formula. 

Similarly the numerator becomes 

Uik-wUih+ij) 

i<j i<j. 

where 

li := Oi + n — i + 1. 
If we set rrii := n — i + 1 then we can write the Weyl dimension formula as 

I 2 -I 2 I- 

dim V(ai, . . . ,a n ) = TT — \ ^ TT — , 

±1 - mf — m'- ±A - rrii 

i<j 1 J i 

where for the case i = j we have taken out a common factor of 2™ from the 
numerator and the denominator. 
An easy induction shows that 

JJ(m? -mf)Y[ mi = (2n- l)!(2n - 3)! - •• 1!. 

i<j i 

so if we set 

n = li — 1 = a + n — i 

then 

dim V(ai,...,a„) = 



U i<J (n-r J )(r l + r ] +2) Uiin + l) 



(2n-l)!(2n-3)!---l! 

For example, suppose we want to compute the dimension of the fundamental 
representation corresponding to A2 = L\ + L 2 so Oi = a 2 = 1, Oj = 0, i > 2. In 
applying the preceding formula, all of the terms with 2 < i are the same as for 
the trivial representation, as is r\ — r 2 . The ratios of the remaining factors to 
those of the trivial representation are 

n . n n 

TT— TT — = TT — 
11 7-1 11 j -2 11 7 2 

j=3 J j=3 J j=3 J 

coming from the r, — rj terms, i = 1,2. Similarly the + rj terms give a factor 

2n + 1 -A- 2n + 2 - j 



in 



2n-l- LJ - 2n-j 

3=3 J 
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and the terms n + 1, T2 + 1 contribute a factor 

n+ 1 
n- 1' 

In multiplying all of these terms together there is a huge cancellation and what 
is left for the dimension of this fundamental representation is 

(2n + l)(2n- 2) 
2 ' 

Notice that this equals 

( 2 2 n ) -l=dim A 2 1/-1. 

More generally this dimension argument will show that the fundamental repre- 
sentations are the kernels of the contraction maps : /\ k — > (V) A fe ~ 2 (V) 
where f2 is the symplectic form. 

For B n it is easy to check that u>i := L\ + ■ ■ ■ + Li (i < n — 1), and cj n — 
\{L\ + • • • + L n ) arc the basic weights and the Weyl dimension formula gives 

the value i^™^ ^ for j < n — 1 as the dimensions of the irreducibles with 

these weight, so that they are A J (V), j = 1, . . . n — 1 while the dimension of the 
irreducible corresponding to ui n is 2™. This is the spin representation which we 
will study later. 

Finally, for D n = o(2n) the basic weights are 

Uj = Li H \- Lj, j < n — 2, 

and 

w„_i := H h i„_! + L„) and u n := ^(L x H h L„-i - L„). 

The Weyl dimension formula shows that the the first n — 2 fundamental repre- 
sentations are in fact the representation on A j (V), j = 1, . . . ,n — 2 while the 
last two have dimension 2 n_1 . These are the half spin representations which we 
will also study later. 



7.10 Equal rank subgroups. 

In this section we present a generalization of the Weyl character formula due 
to Ramond-Gross-Kostant-Stcrnberg. It depends on an interpretation of the 
Weyl denominator in terms of the spin representation of the orthogonal group 
(9(g/h), and so on some results which we will prove in Chapter IX. But its 
logical place is in this chapter. So we will quote the results that we will need. 
You might prefer to read this section after Chapter IX. 
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Let p be an even dimensional space with a symmetric bilinear such that 

P = P+ BP- 
is a direct sum decomposition of p into two isotropic subspaces. In other words 
p + and p are each half the dimension of p, and the scalar product of any two 
vectors in p + vanishes, as does the scalar product of any two elements of p . 
For example, we might take p = n + © n_ and the symmetric bilinear form to 
be the Killing form. Then p± — n± is such a desired decomposition. 

The symmetric bilinear form then puts p± into duality, i.e. we may identify 
p with p* + and vice versa. Suppose that we have a commutative Lie algebra 
h acting on p as infinitesimal isometries, so as to preserve each p±, that the e+ 
are weight vectors corresponding to weights fa and that the e~ form the dual 
basis, corresponding to the negative of these weights — fa. In particular, we have 
a Lie algebra homomorphism v from h to o(p), and the two spin representations 
of o(p) give two representations of h. By abuse of language, let us denote these 
two representations by Spin + „ and Spin_„. We can also consider the characters 
of these representations of h. According to equation (9.22) (to be proved in 
Chapter IX) we have 

ch Spin+u - c\\ S p in _ v = Y[ ( e (^A>) _ e (-\P]) ) • 
3 

In the case that h is the Cartan subalgcbra of a semi-simple Lie algebra and 
and p± = n± we recognize this expression as the Weyl denominator. 

Now let g be a semi-simple Lie algebra and r C g a reductive subalgebra of 
the same rank. This means that we can choose a Cartan subalgebra of g which 
is also a Cartan subalgebra of r. The roots of r form a subset of the roots of g. 
The Weyl group W s acts simply transitively on the Weyl chambers of g each of 
which is contained in a Weyl chamber for r. We choose a positive root system 
for g, which then determines a positive root system for r, and the positive Weyl 
chamber for g is contained in the positive Weyl chamber for r. 

Let 

CcW g 

denote the set of those elements of the Weyl group of g which map the positive 
Weyl chamber of g into the positive Weyl chamber for r. By the simple transi- 
tivity of the Weyl group actions on chambers, we know that elements of C form 
coset representatives for the subgroup W r C W & . In particular, the number of 
elements of C is the same as the index of W r in W s . 
Let 

p s and p r 

denote half the sum of the positive roots of g and r respectively. For any 
dominant weight A of g the weight A + p g lies in the interior of the positive 
Weyl chamber for g. Hence for each c e C, the clement c(A + p g ) lies in the 
interior for r and hence 

c • A := c(A + p s ) — p r 
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is a dominant weight for r, and each of these is distinct. 

Let V\ denote the irreducible representation of g with highest weight A. We 
can consider it as a representation of the subalgebra r. Also the Killing form 
(or more generally any ad invariant symmetric bilinear form) on g induces an 
invariant form on r. Let p denote the orthogonal complement of r in g. We 
thus get a homomorphism of r into the orthogonal algebra o(g/r), which is an 
even dimensional orthogonal algebra, and hence has two spin representations. 
To specify which of these two spin representations we shall denote by 5+ and 
which by S- , we note that there is a one dimensional weight space with weight 
p s — p r , and we let S+ denote the spin representation which contains that one 
dimensional space. The spaces S± are o(g/r) modules, and via the homomor- 
phism r — > o(g/r) we can consider them as r modules. 

Finally, for any dominant integral weight p of r we let U t , denote the irre- 
ducible module of r with highest weight p. 

With all this notation we can now state 

Theorem 16 [G-K-R-S] In the representation ring R(r) we have 

V x ®S + ~V x ®S- = Y / (-l) c Uc.x- (7.31) 

Proof. To say that the above equation holds in the representation ring of r 
means that when we take the signed sums of the characters of the representations 
occurring on both sides we get equality. In the special case that r = h, we have 
observed that (7.31) is just the Weyl character formula: 

X (Irr(A)(x(S +g/h )-x(S_ g/h ))= £ (-l) w e(w(X + p s )). 

wew e 

The general case follows from this special case by dividing both sides of this 
equation by x(S+r/h) — x(S- r /h)- The left hand side becomes the character of 
the left hand side of (7.31) because the weights that go into this quotient via 
(9.22) are exactly those roots of g which are not roots of r. The right hand side 
becomes the character of the right hand side of (9.22) by reorganizing the sum 
and using the Weyl character formula for r. QED 
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Chapter 8 

Serre's theorem. 



We have classified all the possibilities for an irreducible Cartan matrix via the 
classification of the possible Dynkin diagrams. The four major series in our clas- 
sification correspond to the classical simple algebras we introduced in Chapter 
III. The remaining five cases also correspond to simple algebras - the "excep- 
tional algebras" . Each deserves a discussion on its own. However a theorem of 
Serre guarantees that starting with any Cartan matrix, there is a corresponding 
semi-simple Lie algebra. Any root system gives rise to a Cartan matrix. So even 
before studying each of the simple algebras in detail, we know in advance that 
they exist, provided that we know that the corresponding root system exists. 
We present Serre's theorem in this chapter. At the end of the chapter we show 
that each of the exceptional root systems exists. This then proves the existence 
of the exceptional simple Lie algebras. 

8.1 The Serre relations. 

Recall that if a and fl are roots, 

{P,a} := 2- r 

(a, a) 

and the string of roots of the form + ja is unbroken and extends from 

— ra to + qa where r — q=(P,a). 
In particular, if a, p e A so that — a is not a root, the string is 

P,P + a,...,0 + qa 

where 

q=-{p,a). 

Thus 

(ade tt )-<^+% = 0, 
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for e«eg , e^e gp but 

(ad e a ) k e p ^ for < k <-(/?, a), 
if e a 7^ 0, e/3 ^ 0. So if A = {at\, . . . , ag} we may choose 

^ Sail /i ^ S— cti 

so that 

ei, • • • , , et, fi,...,f e 

generate the algebra and 



[hi,hj] = 0, l<i,j,<£ (8.1) 

[eufi] = hi (8.2) 

[ajj] = ijtj (8.3) 

[hi,ej] = (a J -,a i )e j (8.4) 

[ft*,/?] = -{a h ai)fi (8.5) 

(ad e,)- <Q - Ql>+1 e, = i^j (8.6) 

(ad fi)-( a » ai)+1 fj = t^j. (8.7) 



Serre's theorem says that this is a presentation of a (semi-)simple Lie algebra. 
In particular, the Cartan matrix gives a presentation of a simple Lie algebra, 
showing that for every Dynkin diagram there exists a unique simple Lie algebra. 

8.2 The first five relations. 

Let f be the free Lie algebra on 3£ generators, X\, . . . , Xg, Y\, . . . , Ye, Z\, . . . , Zi. 
If g is a semi-simple Lie algebra with generators and relations (8.1)-(8.7), we 
have a unique homomorphism f — > g where Xi — > e j , Yj — > fi , Zi — > /ij . We 
want to consider an intermediate algebra, m, where we make use of all but the 
last two sets of relations. So let I be the ideal in f generated by the elements 

\Zi,Z 3 \, [Xi,Yj] - SijZi, \Zi,X 3 \ - { aj ,ai)Xj, [Z h Y 3 ] + (a 3 ,a t )Y r 

We let m := f /I and denote the image of Xi in m by Xi etc. 

We will first exhibit m as Lie subalgebra of the algebra of endomorphisms of 
a vector space. This will allow us to conclude that the x%, y 3 and Zk are linearly 
independent and from this deduce the structure of m. We will then find that 
there is a homomorphism of m onto our desired semi-simple Lie algebra sending 
x 1 ► c , y 1 — ► f, z 1 ► h. 

So consider a vector space with basis v\ , . . . , vg and let A be the tensor 
algebra over this vector space. We drop the tensor product signs in the algebra, 
so write 

Vi x v i2 ■■■v it —v^ ®---v it 
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for any finite sequence of integers with values from 1 to I. We make A into an 
f module as follows: We let the Zi act as derivations of A, determined by its 
actions on generators by 

Zil = 0, ZjVi = -{oti, cej}vj. 

So if we define 
we have 

z j(vn ■■■v it ) = -(c nj + ■ ■ ■ + c H3 ){v tl ■■■v it ). 

The action of the Zi is diagonal in this basis, so their actions commute. We let 
the Yi act by left multiplication by Vi. So 

YjV ix ■■■v it := VjV it ■■■v lt 

and hence 

[Zi,Yj] = -cjiYj = -{a h ai)Yj 

as desired. We now want to define the action of the Xi so that the relations 
analogous to (8.2) and (8.3) hold. Since Zil = these relations will hold when 
applied to the element 1 if we set 

Xjl=0 Vj 

and 

XjVi = 

Suppose we define 

Xj{v p V q ) = SjpCgjVq. 

Then 

ZiX j(VpVq} ^jpCqjCqiVq — CqiXj(VpVq} 

while 

XjZi(VpV q ) = 8jpC qj {c p i + C qi )Vq = -(c pi + C qi )Xj (VjV q ) . 

Thus 

[Zi,Xj](v p v q ) = CjiXj(v p v g ) 

as desired. 

In general, define 

x j( v Pi ■ ■ ■ v Pt) : = v pi ( x j( v P2 ■■■VpJ)- S pij( c P2j + ■■■ + c Pt3 )(v P2 ■■■v pt ) (8.8) 
for t > 2. We claim that 

ZiXj [Vp l • • • Vp t *) { c pii ~t~ ' ' ' ~t~ Cpti Cji)Xj('Vp 1 • • • V pt ) . 

Indeed, we have verified this for the case t = 2. By induction, we may assume 
that Xj(v P2 ■ ■ ■ v Pt ) is an eigenvector of Zi with eigenvalue c P2 i + ■ ■ ■ + c Pti — 
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cji. Multiplying this on the left by v Pl produces the first term on the right of 
(8.8). On the other hand, this multiplication produces an eigenvector of Z; t with 
eigenvalue c pii + • • • + c Pti — Cji. As for the second term on the right of (8.8), if 
j =/= pi it does not appear. If j = p\ then c pi i + - ■ -+c Pt i—Cji = c P2 i + - ■ -+c Pt i. So 
in either case, the right hand side of (8.8) is an eigenvector of Zi with eigenvalue 
c Pli H h c Pti - c^. But then 



as desired. We have defined an action of f on A whose kernel contains I, hence 
descends to an action of m on A. 

Let <p : m — > End A denote this action. Suppose that z := a,\Z\ + • • • + a^zg 
for some complex numbers a\, . . . , at and that <j>(z) = 0. The operator <p(z) has 
eigenvalues 



when acting on the subspace V of A. All of these must be zero. But the Cartan 
matrix is non-singular. Hence all the otj = 0. This shows that the space spanned 
by the Zi is in fact f-dimensional and spans an ^-dimensional abelian subalgebra 
of m. Call this subalgebra z. 

Now consider the 3f-dimensional subspace of f spanned by the Xi,Yi and 
Zi, i — 1, . . . ,£. We wish to show that it projects onto a 3£ dimensional subspace 
of m under the natural passage to the quotient f — > m = f/i. The image of this 
subspace is spanned by Xi,yi and z,. Since (f>(xi) ^ and <p(yi) ^ we know 
that Xi ^ and yi ^ 0. Suppose we had a linear relation of the form 



Choose some z' G z such that cti(z') ^ and cei(z') ^ <^j{z') for any i =/= j. 
This is possible since the on are all linearly independent. Bracketing the above 
equation by z' gives 



by the relations (8.4) and (8.5). Repeated bracketing by z' and using the van 
der Monde (or induction) argument shows that <Zj = 0, &j = and hence that 



We have proved that the elements Xi,yj,Zk in m are linearly independent. 
The clement 



[Zi,Xj] = (aj,OLi)Xj 






z = 0. 




is an eigenvector of Zi with eigenvalue 



H h Ci t i- 



For any pair of elements fi and A of z* (or of h*) recall that 



li < A 
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denotes the fact that A — ji = ^2 ^i a i where the ki are all non-negative integers. 
For any A £ z* let mj denote the set of all m € m satisfying 

[z,m] = X(z)m Vz e z. 

Then we have shown that the subalgcbra x of m generated by X\,...,xe is 
contained in 

m+ := 0m A . 

O^A 

Similarly, the subalgebra y of m generated by the yi lies in 

m := 0m A . 

A^O 

In particular, the vector space sum 

y + z + x 

is direct since z C mo. We claim that this is in fact all of m. First of all, observe 
that it is a subalgebra. Indeed, [yi, xj] — —SijZi lies in this subspace, and hence 

[yi, [x h ,[- ■ ■ [x jt -i,x jt ] ■ ■ ■} e x for t > 2. 

Thus the subspace y + z + x is closed under ad yi and hence under any product 
of these operators. Similarly for ad Xi. Since these generate the algebra m we 
see that y + z + x = m and hence 

x = m + and y = m . 

We have shown that 

m = m © z © m + 

where z is an abelian subalgebra of dimension I, where the subalgebra m + is 
generated by x\, . . . , xg, where the subalgebra m_ is generated by y\, . . . , yi, and 
where the 3£ elements x\,. . . ,xi,y\. . . ,yt,z\,. . . ,zt are linearly independent. 

There is a further property of m which we want to use in the next section in 
the proof of Serre's theorem. For all i ^ j between 1 and I define the elements 
Xij and yij by 

Xij := (ad Xi)- c i' +1 (xj), y^ := (ad yi)~ Cii+1 (yj). 

Conditions (8.6) and (8.7) amount to setting these elements, and hence the ideal 
that they generate equal to zero. We claim that for all k and all i ^ j between 
1 and I we have 

ad xkiyij) = (8.9) 



and 



ad yk{xij) = 0. 



(8.10) 
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By symmetry, it is enough to prove the first of these equations. If k ^ i then 
[xk, Vi] = by (8.3) and hence 

ad Xk(yij) = (ad yi)~ Cji+1 [x k , yj] = (ad yi)~ Cii+1 S k jhj 

by (8.2) and (8.3). If k ^ j this is zero. If k = j we can write this as 

(ad yi y c ^(&d y t )hj = (ad yi)~ Cii c^yi. 

If Cij = there is nothing to prove. If ^ then Cji ^ and in fact is strictly 
negative since the angles between all elements of a base are obtuse. But then 

(ad y l )~ c ^y i = 0. 

It remains to consider the case where k = i. The algebra generated by Xi, yzi 
is isomorphic to sl(2) with [x^j/i] = z i: [2^,2^] = 2x i} [zi,yi\ = —2 yi . We have 
a decomposition of m into weight spaces for all of z, in particular into weight 
spaces for this little sl(2). Now [xi, yj] = (from (8.3)) so yj is a maximal weight 
vector for this si (2) with weight — cji and (8.9) is just a standard property of a 
maximal weight module for sl(2) with non-negative integer maximal weight. 

8.3 Proof of Serre's theorem. 

Let k be the ideal of m generated by the Xij and y t j as defined above. We wish 
to show that 

g := m/k 

is a semi-simple Lie algebra with Cartan subalgebra h = z/k and root system 
<E>. For this purpose, let i now denote the ideal in m + generated by the x^ and 
j be the ideal in m_ generated by the y^ so that 

i + j c k. 

We claim that j is an ideal of m. Indeed, each y^ is a weight vector for 
z, and [z,m_] C m , hence [z,j] C j. On the other hand, we know that 
[xfe, m_] C m_ + z and [x k , yij] = by (8.9). So (ad Xk)j C j by Jacobi. Since 
the Xk generate m + Jacobi then implies that [m + , j] C j as well, hence j is an 
ideal of m. Similarly, i is an ideal of m. Hence i + j is an ideal of m, and since 
it contains the generators of k, it must coincide with k, i.e. 

k = i+j. 

In particular, ziHk = {0} and so z projects isomorphically onto an ^-dimensional 
abelian subalgebra of g = m/k. Furthermore, since j D m + = {0} and i n m_ = 
{0} we have 

g = n_®h®n+ (8.11) 

as a vector space where 



n = 



m_/j, and n + = m + /i, 
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and n + is a a sum of weight spaces of h, summed over A >- while n_ is a sum 
of weight spaces of h with A -< 0. We have to see which weight spaces survive 
the passage to the quotient. The sl(2) generated by Xi, y%, Zi is not sent into zero 
by the projection of m onto g since Zi is not sent into zero. Since sl(2) is simple, 
this means that the projection map is an isomorphism when restricted to this 
sl(2). Let us denote the images of Xi,yi, Zi by e i7 fi, hi. Thus g is generated by 
the 3£ elements 

ei, • • • , ee, fi, ■ ■ ■ , fe, hi, . . . , he 

and all the axioms (8.1)-(8.7) are satisfied. 

We must show that g is finite dimensional, semi-simple, and has $ as its 
root system. 

First observe that ad acts nilpotently on each of the generators of the 
algebra g, and hence acts locally nilpotently on all of g. Similarly for ad fi. 
Hence the automorphism 

n := (expad a) (ad - /;)(expad e,) 

is well defined on all of g. So if Si denotes the reflection in the Weyl group W 
corresponding to i, we have 

n(g\) = g Ss A- 

Notice that each of the is finite dimensional, since the dimension of for 
A ^ is at most the number of ways to write A as a sum of successive Qj, each 
such sum corresponding to the element [x il , [x i2 , [• • • ,x it ] ■■■]. (In particular 
m/ca = {0} for k > 1.) Similarly for A -< 0. So it follows that each of the g\ is 
finite dimensional, that 

dimg^A = dimgA Vw E W 

and that 

g kx = for k+ -1,0,1. 

Furthermore, g Qi is one dimensional, and since every root is conjugate to a 
simple root, we conclude that 

dimg Q = 1 Va€$. 

We now show that 

g A = {0} forA^O, A^$. 

Indeed, suppose that gA ^ {0}. We know that A is not a multiple of a for any 
q€$, since we know this to be true for simple roots, and the dimensions of the 
gA are invariant under the Weyl group, each root being conjugate to a simple 
root. So X 1 - does not coincide with any hyperplane orthogonal to any root. So 
we can find a/ie) 1 such that (a, fi) ^ for all roots. We may find a w <E W 
which maps /x into the positive Weyl chamber for A so that (aj,/i) > and 
hence (a*, wfi) > for i = 1, . . . , I. Now 

dimg„,A = dimgA 
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and for the latter to be non-zero, we must have 

wA = kiOii 

with the coefficients all non-negative or non-positive integers. But 
= (A, m) = OA, io/x) = k *( a '" V) 

with (a i7 fi) > Vi. Hence all the ki — 0. 
So 

dimg = t + Card $. 

We conclude the proof if we show that g is semi-simple, i.e. contains no abclian 
ideals. So suppose that a is an abelian ideal. Since a is an ideal, it is stable 
under h and hence decomposes into weight spaces. If g Q fla ^ {0}, then g Q C a 
and hence [g_ a ,g a ] C a and hence the entire sl(2) generated by g Q and g_ Q 
is contained in a which is impossible since a is abelian and sl(2) is simple. So 
a C h. But then a must be annihilated by all the roots, which implies that 
a = {0} since the roots span h*. QED 



8.4 The existence of the exceptional root sys- 
tems. 

The idea of the construction is as follows. For each Dynkin diagram we will 
chose a lattice L in a Euclidean space V, and then let $ consist of all vectors in 
this lattice having all the same length, or having one of two prescribed lengths. 
We then check that 

2{ct\ , ao) „ , , 
{ai, ai) 

This implies that reflection through the hyperplane orthogonal to ot\ preserves 
L, and since reflections preserve length that these reflection s preserve This 
will show that <& is a root system and then calculation shows that it is of the 
desired type. 

l X \ 

(G2). Let V be the plane in R 3 consisting of all vectors y with 

w 

x + y + z = 0. 

Let L be the intersection of the three dimensional standard lattice 1? with V. 
Let Li,L,2,L,3 denote the standard basis of K 3 . Let $ consist of all vectors in 
L of squared length 2 or 6. So $ consists of the six short vectors 



±(Li - Lj) i < j 



8.4. THE EXISTENCE OF THE EXCEPTIONAL ROOT SYSTEMS. 

and the six long vectors 

±{2L t - Lj — L k ) i= 1,2,3 j 56 i, j± k. 
We may choose the base to consist of 

a=L\ — L 2 , a 2 = —2L\ + L 2 + L 3 . 
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F 4 . Let V = M 4 and L = Z 4 + Z(|(Li + L 2 + L 3 + L 4 ). Let $ consist of all 
vectors of L of squared length 1 or 2. So $ consists of the 24 long roots 

±Li ±Lj i< j 

and the 24 short roots 

±L U ^{±L 1 ±L 2 ±L 3 ±L i ). 

For A we may take 

ai = L 2 -L s , a 2 = L 3 -L 4 , a 3 = L 4 , a 4 = -(Li - L 2 - L 3 - L 4 ). 

E 8 . Let V = E 8 . Let 

L' := (^ciLi, CiEZ, ^ c H even}. 

Let 

L := L' + Z(^(Lj + L x + L 2 + L 3 + L 4 + L 5 + L 6 + L 7 + L 8 )). 

Let $ consist of all vectors in L of squared length 2. So $ consists of the 240 
roots 

1 8 

±Li±Lj (i < j), o^Z 1 * 1 ^' ( even number of + signs). 

2 i= i 

For A we may take 

ax — — L\ — L 2 — L 3 — L 4 — L$ — Lq — L7 + Lg) 

a 2 = L\ + L 2 

oti = Lj_i-Li_2 (3<«<8). 

E-j is obtained from E$ by letting V be the span of the first 7«j. £3 is 
obtained from E s by letting V be the span of the first 6 a, . 



Chapter 9 



Clifford algebras and spin 
representations. 

9.1 Definition and basic properties 
9.1.1 Definition. 

Let p be a vector space with a symmetric bilinear form ( , ). The Clifford 
algebra associated to this data is the algebra 

C(p) := T(p)/J 

where T(p) denotes the tensor algebra 

T(p) = fcepe(p<g)p)®--- 

and where / denotes the ideal in T(p) generated by all elements of the form 

2/12/2 + 2/22/1 -2(y 1 ,y 2 )l, 2/1,2/2 £P 

and 1 is the unit element of the tensor algebra. The space p injects as a subspace 
of C(p) and generates C(p) as an algebra. 

A linear map / of p to an associative algebra A with unit 1a is called a 
Clifford map if 

/(2/i)/(2/2) + /(2/2)/(2/i) = 2(2/i, 2/2)U, Vj/i, y 2 £p 

or what amounts to the same thing (by polarization since we are not over a field 
of characteristic 2) if 

f(y) 2 = (y,y)U V y e p. 

Any Clifford map gives rise to a unique algebra homomorphism of C(p) to A 
whose restriction to p is /. The Clifford algebra is "universal" with respect to 
this property. 

If the bilinear form is identically zero, then C(p) = Ap, the exterior algebra. 
But we will be interested in the opposite extreme, the case where the bilinear 
form is non-degenerate. 
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9.1.2 Gradation. 

The ideal I defining the Clifford algebra is not Z homogeneous (unless the 
bilinear form is identically zero) since its generators 2/12/2 + 2/22/1 — 2(2/1, 2/2)1 are 
"mixed" , being a sum of terms of degree two and degree zero in T(p) . But these 
terms are both even. So the Z/2Z gradation is preserved upon passing to the 
quotient. In other words, C(p) is a Z/2Z graded algebra: 

c(p) = c (p)eCi( P ) 

where the elements of Co(p) consist of sums of products of elements of p with 
an even number of factors and d (p) consist of sums of terms each a product of 
elements of p with an odd number of factors. The usual rules for multiplication 
of a graded algebra obtain: 

C (p) • C (p) C C (p), Co( P ) • d(p) C dfo), d(p) • d(p) C C (p). 

9.1.3 Ap as a C(p) module. 

Let p be a vector space with a non-degenerate symmetric bilinear form. The 
exterior algebra, Ap inherits a bilinear form which we continue to denote by 
( , ). Here the spaces A fe (p) and A £ (p) are orthogonal if k ^ I while 

(xi A • • • A Xk,yi A • • • A 2/fe) = det {(x l ,y )) . 

For u e p let e(v) € End(Ap) denote exterior multiplication by v and l(v) be 
the transpose of e(v) relative to this biinear form on Ap. 

So l(v) is interior multiplication by the element of p* corresponding to v 
under the map p — > p* induced by ( , ) p . The map 

p^ End(Ap), v i-> e(v) + l(v) 

is a Clifford map, i.e. satisfies 

(e(v) + i{v)f = (v,v) p id 

and so extends to a homomorphism of 

C(p) -» End Ap 

making Ap into a C(p) module. We let denote the product of x and y in 
C(p). 

9.1.4 Chevalley's linear identification of C(p) with Ap. 

Consider the linear map 

C(p) — > Ap, mil 
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where 1 e A°p under the identification of A°p with the ground field. The 
element il on the extreme right means the image of 1 under the action of 
x G C(p). 

For elements «i,...,«fe€p this map sends 

V\ i — ► Vi 
ViV 2 H-» Wl A «2 + (l>l, 1>2)1 
«lW2«3 !-» «1 Af2 A V 3 + {vi,V 2 )v 3 - {vi,V 3 )v 2 + {v 2 ,V 3 )vi 

V1V2V3V4 i-> ui A v 2 A w 3 A v 4 + (v 2 , v 3 )v\ A v 4 — (v 2 , v±)v\ A v 3 
+ (v 3 ,v 4 )v 1 A f 2 + (wi,w 2 )w3 A w 4 - («!, w 3 )ui A u 4 
+ (vi,v 4 )v 2 Av 3 + (vi,V4,)(v2,v 3 ) - {vi,v 3 )(v 2 ,v 4 ) + {vi,v 2 )(v 3 ,v 4 ) 



If the w's form an "orthonormal" basis of p then the products 
«ii ■■■«»*, ii < «2 ■ ■ ■ < fc = 0,l,...,n 
form a basis of C(p) while the 

Uii A • • • Ai)i k , h<i2--- <ik, fe = 0, l,...,n 
form a basis of Ap, and in fact 

t>i ■ ■ ■ i>fc 1— > i>i A ■ ■ ■ A t>k if (vj, Uj) = Vi 7^ j. (9.1) 

In particular, the map C(p) — + Ap given above is an isomorphism of vector 
spaces, so we may identify C(p) with Ap as a vector space if we choose, and 
then consider that Ap has two products: the Clifford product which we denote 
by juxtaposition and the exterior product which we denote with a A. 

Notice that this identification preserves the Z/2Z gradation, an even element 
of the Clifford algebra is identified with an even element of the exterior algebra 
and an odd clement is identified with an odd element. 



9.1.5 The canonical antiautomorphism. 

The Clifford algebra has a canonical anti-automorphism a which is the identity 
map on p. In particular, for Ujgpwe have a{v\V2) = V2V1, a(viv 2 v 3 ) — v 3 v 2 v\, 
etc. By abuse of language, we use the same letter a to denote the similar anti- 
automorphism on Ap and observe from the above computations (in particular 
from the corresponding choice of bases) that a commutes with our identifying 
map C(p) — » Ap so the notation is consistent. We have 



a= (_i)|fe(fe-i) id on A fe ( p ). 
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For small values of k we have 



k 


(_l)|fe(fe-i) 





1 


1 


1 


2 


-1 


3 


-1 


4 


1 


5 


1 


6 


-1. 



We will use subscripts to denote the homogeneous components of elements 
of Ap. Notice that if u £ A 2 p then au = —u by the above table, while 
a(u 2 ) = (au) 2 = u 2 . Since u 2 is even (and hence has only even homogeneous 
components) and since the maximum degree of the homogeneous component of 
u 2 is 4, we conclude that 

u 2 = (u 2 ) + (w 2 ) 4 VuGA 2 p. (9.2) 

For the same reason 

v 2 = (v 2 ) + (v 2 ) 4 V»£ A 3 p. (9.3) 

We also claim the following: 

(ww')o = (aw,w') = (-l)^ k -^{w,w') yw,w'eA k (p). (9.4) 

Indeed, it is sufficient to verify this for w, w' belonging to a basis of Ap, say the 
basis given by all elements of the form (9.1), in which case both sides of (9.4) 
vanish unless w — w' . If w = w' = v\ A ■ ■ ■ A Vk (say) then 

(Ho = L ( v i) ■ ■ ■ L ( v k)vi A ■ ■ ■ A v k = 
{ -l)^(k-D {vijVi) . . . {vkiVk) = { -i)iHk-D {WiW) 

proving (9.4). 

As special cases that we will use later on, observe that 

(uu')o = -{u,v!) Vu,u'eA 2 p (9.5) 

and 

(vv')o = ~{v,v') V v,v' e A 3 p. (9.6) 

9.1.6 Commutator by an element of p. 

For any y £ p consider the linear map 

w i — ► [y, w] = yw — (—l) wy for w e A fe p 
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which is (anti)commutator in the Clifford multiplication by y. We claim that 

[y,w] = 2t(y)w. (9.7) 

In particular, [y, •], which is automatically a derivation for the Clifford multi- 
plication, is also a derivation for the exterior multiplication. Alternatively, this 
equation says that which is a derivation for the exterior algebra multipli- 
cation, is also a derivation for the Clifford multiplication. 
To prove (9.7) write 

wy = a(ya(w)). 

Then 

yw — y Aw + t(y)w, wy = a(y A a(w)) + a(i(y)aw) — w A y + (ai(y)a)w. 
We may assume that w £ A fc p. Then 

y Aw - (-l) k w A y = 0, 

so we must show that 

ai(y)aw = (-l) fe_1 i(y)w . 
For this we may assume that y ^ and we may write 

w = u A z + z , 

where i(y)u = 1 and t(y)z = t(y)z' = 0. In fact, we may assume that z and z' 
are sums of products of linear elements all of which are orthogonal to y. Then 
t(y)az = i(y)az' = so 

i{y)aw = (-l) fe ~ 1 az 
since z has degree one less than w and hence 

ai(y)aw = {-l) k ^z = (-l) fc_1 i(y)w . QED 

9.1.7 Commutator by an element of A 2 p. 

Suppose that 

u e A 2 p. 

Then for y £ p we have 

[u,y] = -[y,u] = -2i{y)u. (9.8) 

In particular, if u — yi A yj where yi, yj £ p we have 

[u,y] = 2(y J ,y)y l ~2(y il y)y 3 VjiGp. (9.9) 

If (yi, yj) — this is an "infinitesimal rotation" in the plane spanned by yt and 
yj. Since yiAyj, i < j form a basis of A 2 p if t/i, . . . , y n form an "orthonormal" 
basis of p, we see that the map 

u i ► [u, ■] 
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gives an isomorphism of A 2 p with the orthogonal algebra o(p). This identifi- 
cation differs by a factor of two from the identification that we had been using 
earlier. 

Now each clement of o(p) (in fact any linear transformation on p) induces 
a derivation of Ap. We claim that under the above identification of A 2 p with 
o(p), the derivation corresponding to u <G A 2 p is Clifford commutation by u. In 
symbols, if 9 U denotes this induced derivation, we claim that 

9u{w) = [u, w\ — uw — wu Vm£ Ap. (9.10) 

To verify this, it is enough to check it on basis elements of the form (9.1), and 
hence by the derivation property for each Vj, where this reduces to (9.8). 

We can now be more explicit about the degree four component of the Clifford 
square of an element of A 2 p, i.e. the element (w 2 )4 occurring on the right of 
(9.2). We claim that for any three elements y,y',y" € p 

^b{y")b{y')t{y)u 2 = {yAy' ,u)t{y")u+{y' Ay" ,u)i{y)u+{y" Ay,u)t{y')u. (9.11) 
To prove this observe that 

L {y) u2 = {i{y)u)u + u{i{y)u) 
t{y')t{y)u 2 = {i{y')b{y)u) u - b(y)ub{y')u + b{y')ub{y)u + ub{y')b{y)u 
= 2((yAy',u)u+L(y')uAb(y)u) 

h{y")i{y')L{y)u 2 = (y A y' , u)b{y")u + i{y")L{y')u A b(y)u - i{y')u A i{y")t{y)u 

= (y A y', u)b{y")u + (y' A y" , u)b{y)u + (y" A y, u)b{y')u 

as required. 

We can also be explicit about the degree zero component of u 2 . Indeed, it 
follows from (9.9) that if u — yi A y 3 i < j where y\, . . . , y n form an "orthonor- 
mal" basis of p then 

tr(ad p uf = -8{yi,yi){yj,yj), 

where ad p u denotes the (commutator) action of u on p under our identification 
of A 2 p with o(p). But 

{yi A yj,yt a yj) = {yi,yi){yj,yj){= ±1). 
So using (9.5) we see that 

(u 2 ) = I tr(ad p u) 2 = -(u, u) (9.12) 

o 

for u e A 2 p. 
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9.2 Orthogonal action of a Lie algebra. 

Let r be a Lie algebra. Suppose that we have a representation of r acting 
as infinitesimal orthogonal transformations of p which means, in view of the 
identification of A 2 p with o(p) that we have a map 

v : r — > A 2 p 

such that 

x ■ y = -2u{y)u{x) (9.13) 
where x ■ y denotes the action of x € r on y e p. 



9.2.1 Expression for v in terms of dual bases. 

It will be useful for us to write equation (9.13) in terms of a basis. So let 
yi , . . . , y n be a basis of p and let z\ , . . . , z n be the dual basis relative to ( , ) = 
( , ) p . We claim that 



i 

Indeed, it suffices to verify (9.13) for each of the elements Zi. Now 
t ( z *) |"JE% Aj; ' z i ^ = _ \ x ' Zi + \ x ■ z ^Vr 

But 

(,Zi 7 X • Zj^ (x • Zj) 

since x acts as an infinitesimal orthogonal transformation relative to ( , ). So 
we can write the sum as 

— ^^(zj, x ■ Zj)yj = — - ~y^^(x ■ Zi, Zj)yj = ~~^ x ' z > 

3 j 

yielding 



y^ Ax- zj ] = --x ■ Zi 



which is (9.13). 



9.2.2 The adjoint action of a reductive Lie algebra. 

For future use we record here a special case of (9.14): Suppose that p = r = g 
is a reductive Lie algebra with an invariant symmetric bilinear form, and the 
action is the adjoint action, i.e. x ■ y — [x,y]. Let h be a Cartan subalgebra of g 
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and let <J> denote the set of roots and suppose that we have chosen root vectors 
e^, e_0, (f> <G $ so that 

(e0,e_ ) = 1. 

Let hi, . . . , h s be a basis of h and ki,...k s the dual basis. Let 

V> : g -» A 2 g 

be the map 1/ when applied to this adjoint action. Then (9.14) becomes 

^( x ) = \ \ Yl, hiA \ k ii x \ + ^2 e -4> A I e <^ x ] ■ ( 9 - 15 ) 
\*=i </>e* / 

In case i=fceh this formula simplifies. The [ki,h] = 0, and in the second 
sum we have 

e_0 A [e<j>, h] = -</>(h)e-^ A 

which is invariant under the interchange of <f> and —<j>. So let us make a choice 
of positive roots. Then we can write (9.15) as 

= ~\ ^)M Ae f ^h- (9-16) 

Now 

A e0 = -1 + e_^e0. 

So if 

p--=\J2^ ( 9 - 17 ) 

is one half the sum of the positive roots we have 

V>(/i) = p(h) - X - 4>{h)e-^, heh. (9.18) 

<j>£<S>+ 

In this equation, the multiplication on the right is in the Clifford algebra. 

9.3 The spin representations. 

if 

P = pi ffip 2 

is a direct sum decomposition of a vector space p with a symmetric bilinear 
form into two orthogonal subspaces then it follows from the definition of the 
Clifford algebra that 

C(p) = C(pi) ® C(p 2 ) 
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where the multiplication on the tensor product is taken in the sense of superal- 
gebras, that is 

(ai ® a 2 )(&i ® 62) := <Ji&i ® «2&2 
if either a2 or 61 arc even, but 

(ai <8> a 2 )(6i ® 6 2 ) := —a\b\ <g> a 2 &2 

if both a 2 and 61 are odd. It costs a sign to move one odd symbol past another. 

9.3.1 The even dimensional case. 

Suppose that p is even dimensional. If the metric is split (which is always the 
case if the metric is non-degenerate and we are over the complex numbers) then 
p is a direct sum of two dimensional mutually orthogonal split spaces, Wj, so 
let us examine first the case of a two dimensional split space p, spanned by 1, e 
with (t, t) = (e, e) = 0, (1, e) = \. Let T be a one dimensional space with basis 
t and consider the linear map of p — > End (AT) determined by 

e 1 ► e(t), 1 1— ► i(t*) 

where e(t) denotes exterior multiplication by t and t(t*) denotes interior multi- 
plication by t* , the dual element to t in T* . This is a Clifford map since 

e{tf = = i(t*f, e{t)i{f) + b(t*)e(t) = id. 

This therefore extends to a map of C(p) — > End(AT). Explicitly, if we use 
1 e A°T, t g A X T as a basis of AT this map is given by 

' - (0 I) 
-(") 

Lt " (J o)- 

This shows that the map is an isomorphism. If now 

p = Wi$-8W m 
is a direct sum of two dimensional split spaces, and we write 



T = Ti© 
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where the C(Wj) = End(ATj) as above, then since 

AT = ATi <g> ■ ■ ■ <g> AT m 

we see that 

C(p) =End (AT). 

In particular, C(p) is isomorphic to the full 2 m x 2 m matrix algebra and hence 
has a unique (up to isomorphism) irreducible module. One model of this is 

S = AT. 

We can write 

S = S+ © S_ 

as a supervector space, where we choose the standard Z 2 grading on AT to 
determine the grading on S if m is even, but use the opposite grading (for 
reasons which will become apparent in a moment) if m is odd. 

The even part, Co(p) of C(p) acts irreducibly on each of S±. Since A 2 p 
together with the constants generates Co(p) we see that the action of A 2 p on 
each of S± is irreducible. Since A 2 p under Clifford commutation is isomorphic 
to o(p) the two modules S± give irreducible modules for the even orthogonal 
algebra o(p). These are the half spin representations of the even orthogonal 
algebras. 

We can identify S — S + S- as a left ideal in C(p) as follows: Suppose 
that we write 

p = p+ © p 

where p± are complementary isotropic subspaces. Choose a basis e\ , . . . , e+ of 
p + and let 

e+ := e+ • • • e+ = e+ A • • • A e+ e A m p+. 

We have 

y+e+ = 0, V y+ e p+ 

and hence 

(Ap+)+e+ = 0. 

In other words 

Ap+e+ 

consists of all scalar multiples of e+. 
Since 

Ap_ ® Ap + — > C(p), w_ ® w + i ^ w-w + 
is a linear bijection, we see that 

C(p)e + = Ap_e + . 

This means that the left ideal generated by e+ in C(p) has dimension 2 m , and 
hence must be isomorphic as a left C(p) module to S. In particular it is a 
minimal left ideal. 
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Let e 1 ,...e m be a basis of p_ and for any subset J = . . . , ij}, i\ < 
i<i ■ ■ ■ < ij of {1, . . . , to} let 

e - : = e n A ' ' ' A % = e n ■ ■ ■ %■ 

Then the elements 

e J _e + 

form a basis of this model of S as J ranges over all subsets of {1, ... , to}. 

For example, suppose that we have a commutative Lie algebra h acting 
on p as infinitesimal isometries, so as to preserve each p±, that the e\ are 
weight vectors corresponding to weights (3i and that the e~ form the dual basis, 
corresponding to the negative of these weights — Then it follows from (9.14) 
that the image, v(h) € A 2 (p) C C(p) of an element ft. G h is given by 



Thus 
where 



i/(/i)e+ = p P (/i)e+ (9.19) 



P P :=^(/3i + --- + /? m ). (9.20) 



For a subset J of {1, . . . , to} let us set 

(3 j :=Y,fr- 



Then we have 
and so 



[v(h),el] = -0j(h)ei 



v{h)(e J _e+) - [v(h),el]e+ + e J _u(h)e+ = {p p {h) - (i.j{h))e J _e+. 

So if we denote the action of v{h) on S± by Spin± v(h) and the action of v(h) 
on S — S+ S- by Spin v(h) we have proved that 

The e J _e + are weight vectors of Spin v with weights p p — (3j. (9-21) 

It follows from (9.21) that the difference of the characters of Spin + j/ and Spin_^ 
is given by 

ch S p in+v - Chspin.v = ]J ( e (|ft ) - e i-\Pj) ) = e (P P ) II e ("^)) ■ 



(9.22) 

There are two special cases which are of particular importance: First, this 
applies to the case where we take h to be a Cartan subalgebra of o(p) = o(C 2fe ) 
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itself, say the diagonal matrices in the block decomposition of o(p) given by the 
decomposition 

C 2k = c k ®C k 

into two isotropic subspaces. In this case the /?, is just the i-th diagonal entry 
and (9.22) yields the standard formula for the difference of the characters of the 
spin representations of the even orthogonal algebras. 

A second very important case is where we take h to be the Cartan subalgebra 
of a semi- simple Lie algebra g, and take 

p := n + © n_ 

relative to a choice of positive roots. Then the (3j are just the positive roots, 
and we see that the right hand side of (9.22) is just the Weyl denominator, the 
denominator occurring in the Weyl character formula. This means that we can 
write the Weyl character formula as 

ch(Irr(A) ® S+) - ch(Irr(A) ® 5_) = J2 (- 1 )"' e ( w • A ) 

wew 

where 

w • A := w(X + p). 

If we let Ufj, denote the one dimensional module for h given by the weight /i we 
can drop the characters from the preceding equation and simply write the Weyl 
character formula as an equation in virtual representations of h: 

Irr(A) ® S+ - Irr(A) ® S- = ^ (-l) w U wmX - (9.23) 

The reader can now go back to the preceding chapter and to Theorem 16 
where this version of the Weyl character formula has been generalized from the 
Cartan subalgebra to the case of a reductive subalgebra of equal rank. In the 
next chapter we shall sec the meaning of this generalization in terms of the 
Kostant Dirac operator. 

9.3.2 The odd dimensional case. 

Since every odd dimensional space with a non-singular bilinear form can be 
written as a sum of a one dimensional space and an even dimensional space (both 
non-degenerate), we need only look at the Clifford algebra of a one dimensional 
space with a basis element x such that (x, x) = 1 (since we are over the complex 
numbers). This Clifford algebra is two dimensional, spanned by 1 and x with 
x 2 = 1, the element x being odd. This algebra clearly has itself as a canonical 
module under left multiplication and is irreducible as a Z/2Z module. We may 
call this the spin representation of Clifford algebra of a one dimensional space. 
Under the even part of the Clifford algebra (i.e. under the scalars) it splits 
into two isomorphic (one dimensional) spaces corresponding to the basis 1 , x of 
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the Clifford algebra. Relative to this basis 1, x we have the left multiplication 
representation given by 



1 i ► 











u 









Let us use C(C) to denote the Clifford algebra of the one dimensional or- 
thogonal vector space just described, and 5(C) its canonical module. Then 
if 

q = p® C 

is an orthogonal decomposition of an odd dimensional vector space into a direct 
sum of an even dimensional space and a one dimensional space (both non- 
degenerate) we have 

C(q) £* C(p) ® C(C) £* End(5(q)) 

where 

5(q) := 5(p) ® 5(C) 

all tensor products being taken in the sense of superalgebra. We have a decom- 
position 

5(q) = 5+(q)®5_(q) 
as a super vector space where 

5+(q) - 5+(p) ® z5_(p), 5_(q) = 5_(p) ® x5+(p). 

These two spaces are equivalent and irreducible as Co(q) modules. Since the 
even part of the Clifford algebra is generated by A 2 q together with the scalars, 
we see that either of these spaces is a model for the irreducible spin representa- 
tion of o(q) in this odd dimensional case. 

Consider the decomposition p = p + ® p that we used to construct a model 
for 5(p) as being the left ideal in C(p) generated by A m p+ where m = dimp + . 
We have 

A(Cffip_) = A(C) ® Ap_, 

and 

Proposition 28 The left ideal in the Clifford algebra generated by A m p+ is a 
model for the spin representation. 

Notice that this description is valid for both the even and the odd dimensional 
case. 

9.3.3 Spin ad and V p . 

We want to consider the following situation: g is a simple Lie algebra and we 
take ( , ) to be the Killing form. We have 



$ : g -» A 2 g C C(g) 
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which is the map v associated to the adjoint representation of g. Let h be 
a Cartan subalgebra and <& the collection of roots. We choose root vectors 
e^, <p € <& so that 

(e0,e_ ) = 1. 

Then it follows from (9.14) that 

where the brackets are the Lie brackets of g, where the hi range over a basis 
of h and the fcj over a dual basis. This equation simplifies in the special cases 
where x = h € h and in the case where x — e^, ip € <I> + relative to a choice, 
<f> + of positive roots. In the case that x = h e h we have seen that [fej, /i] = 
and the equation simplifies to 

$(h) =p{h)l- 1 - J2 <PWe-*e* (9-25) 

where 

is one half the sum of then positive roots. 
We claim that for ip e <f> + we have 

^^^^^ ( 9 - 26 ) 
where the sum is over pairs (7', ip') such that either 

1. 7' = 0, ip' = ip and Xji £ h or 

2. 7' G ip' e <&+ and 7' + ?/>' = ip, and xy e gy . 

To see this, first observe that this first sum on the right of (9.24) gives 

i A 

and so all these summands are of the form 1). For each summand 

of the second sum, we may assume that either <f> + ip = or that <j> + ip e $ for 
otherwise [e^, e^] = 0. If ^ + ip = 0, so -0 = —<p ^ 0, we have [e^, e,/,] € h which 
is orthogonal to since <p> ^ 0. So 

again has the form 1). 
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If 4> + tjj = t ^ is a root, then (e_«^, e T ) = since </> ^ t. If t € $ + then 

where y T is a multiple of e T so this summand is of the form 2) . If t is a negative 
root, the must be a negative root so —cj) is a positive root, and we can switch 
the order of the factors in the preceding expression at the expense of introducing 
a sign. So again this is of the form 2), completing the proof of (9.26). 

Let n + be the subalgebra of g generated by the positive root vectors and 
similarly n_ the subalgebra generated by the negative root vectors so 

g = n + b_ , b := n _ © h 

is an h stable decomposition of g into a direct sum of the nilradical and its 
opposite Borel subalgebra. 

Let N be the number of positive roots and let 

O^ne A N n+. 

Clearly 

yn — Vy € n + . 

Hence by (9.26) we have 

<f>(n + )n = 

while by (9.25) 

$(h)n = p{h)n V/ieh. 
This implies that the cyclic module 

&(U(g))n 

is a model for the irreducible representation V p of g with highest weight p. Left 
multiplication by $(x), x € g gives the action of g on this module. 

Furthermore, if nc ^ for some c g C(g) then nc has the same property: 

$(n + )nc = 0, <&{h)nc — p(h)nc, Vfceh. 

Thus every nc ^ also generates a g module isomorphic to V p . 
Now the map 

An + <g> Ab_ — > C(g), x<g>b ^ xb 

is a linear isomorphism and right Clifford multiplication of A Ar n + by An + is 
just A^n+j all the elements of of A + n + yielding 0. So we have the vector space 
isomorphism 

nC(g) = A w n + ® Ab_. 



In other words, 



$(C/(g))nC(g) 
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is a direct sum of irreducible modules all isomorphic to V p with multiplicity 
equal to 

dim Ab_ = 2 S+N 

where s = dimh and N = dimn_ = dimn + . Let us compute the dimension 
of V p using the Weyl dimension formula which asserts that for any irreducible 
finite dimensional representation V\ with highest weight A we have 

dim V\ = 



n^,e* + (Pi <t>) 

If we plug in A = p we see that each factor in the numerator is twice the 
corresponding factor in the denominator so 

AimV p = 2 N . (9.27) 

But then 

dim$([/(g))nC(g) = 2 S+2N = dimC(g). 

This implies that 

C(g) = $(f/(g))nC(g) = $(tf(g))n(Ab_), (9.28) 

proving that C(g) is primary of type V p with multiplicity 2 S+N as a represen- 
tation of g under the left multiplication action of $(g). 

This implies that any submodule for this action, in particular any left ideal 
of C(g), is primary of type V p . Since we have realized the spin representation 
of C(g) as a left ideal in C(g) we have proved the important 

Theorem 17 Spin ad is primary of type V p . 

One consequence of this theorem is the following: 
Proposition 29 The weights ofV p are 

P-^j (9.29) 
where J ranges over subsets of the positive roots and 



4>eJ 



each occurring with multiplicity equal to the number of subsets J yielding the 
same value of <pj. 

Indeed, (9.21) gives the weights of Spin ad, but several of the (3j are equal 
due to the trivial action of ad(h) on itself. However this contribution to the 
multiplicity of each weight occurring in (9.21) is the same, and hence is equal 
to the multiplicity of V p in Spin ad. So each weight vector of V p must be of the 
form (9.29) each occurring with the multiplicity given in the proposition. 



Chapter 10 

The Kostant Dirac operator 



Let p be a vector space with a non-degenerate symmetric bilinear form. We 
have the Clifford algebra C(p) and the identification of o(p) = A 2 (p) inside 
C(p). 

10.1 Antisymmetric trilinear forms. 

Let 4> be an antisymmetric trilinear form on p. Then <j> defines an antisymmetric 
map 

& = &0:p<gip^p 

by the formula 

(b(y,y'),y") = cP(y,y',y") V y, y', y" € p. 
This bilinear map "leaves ( , ) invariant" in the sense that 

(b(y,y'),y") = (y,b(y',y"))- 

Conversely, any antisymmetric map b : p <£> p — > p satisfying this condition 
defines an antisymmetric form (/>. Finally either of these two objects defines an 
element v € A 3 p by 

-2(v, y Ay' A y") = (b(y, y'),y") = cj>(y, y', y"). (10.1) 

We can write this relation in several alternative ways: Since 

-2(v, y Ay' A y") = -2(t(y')L(y)v, y") = 2(t(y)i(y')v, y") 

we have 

b(y,y') = 2L(y)i(y')v. (10.2) 

Also, i(y)v € A 2 p and so is identified with an element of o(p) by commutator 
in the Cliford algebra: 

ad(L(y)v)(y') = [i(y)v,y'] = -2i(y')L(y)v 
163 



164 



CHAPTER 10. THE KOSTANT DIRAC OPERATOR 



so 

a d(i(y)v)(y') - [t(y)v, y'} - b(y, y'). (10.3) 

10.2 Jacobi and Clifford. 

Given an antisymmetric bilinear map b : p <g) p — > p we may define 

Jac(6) : p ® p ® p — > p 

by 

Jac(6)(y, y', y") = b(b(y, y'),y") + b(b(y', y"),y) + b(b(y", y),y') 

so that the vanishing of Jac(6) is the usual Jacobi identity. It is easy to check 
that Jac(6) is antisymmetric and that if b satisfies (b(y,y'),y") = (y,b(y' ,y")) 
then the four form 

y,y',y",y"' ~(j a c(b)(y,y',y"),y"') 
is antisymmetric. We claim that if v £ A 3 p as in the preceding subsection, then 

L(y")L(y')L( y y = ±J<ic(b)(y,y', y "). (10.4) 
To prove this observe that 

>-{y)v 2 = (i(y)v)v - v(i(y)v) 
i(y')L(y)v 2 = (i(y')L(y)v)v + (i( y )v){i{y')v) - (i(y')v){b{y)v) + v(i(y')b(y)v) 
i{y")i{y')i{y)v 2 = -{i{y')i{y)v)i{y")v + {i{y")v){i{y')i{y)v) + {i{y")i{y)v)i{y')v 

+ {i{y)v){i{y")i{y')v {i{y")i{y')v){i{y)v) - {i{y')v){i{y")i{y)v)) 
= [t{y")v, i(y'Hy)v] + [t(y')v, i{yW)v] + [ t (y)v, i^W)v] 

= l -Uc{b){y,y' ,y") 

by (10.2) and (10.3). 

Equation (10.4) describes the degree four component of v 2 in terms of Jac(6). 
We can be explicit about the degree zero component of v 2 . We claim that 



1 " 

= 2i tr ^^^ e AviXyo,y)% <j- (10-5) 

Indeed, by (9.6) we know that (v 2 )o = —(v, v) and since yi A y 3 A yk, i < j < k 
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form an "orthonormal" basis of A 3 p we have 

~{v,v) = - ^2 ±(v,y i Ay j Ay k ) 2 ,± = e i e j e k 

l<i<j<k<n 
_^ n,n,n 

= -g M v ,Vi A Vj A Vk) 2 

»=i,j=i,fc=i 

_^ n,n,n 

= -g 51 iMa/kMs/jKy*) 2 

i=l,j=l,fc=l 
_^ n.n,n 

= ~24 H ±(&(lfe.y*)>w) 2 
»=i,j=i,fc=i 

= H ejtk{b{yj,yk),b{yj,yk)) 

j=i,fc=i 

= ^4 ^k{b{yj,b{yj,y k )),y k ) 
j=i,fc=i 

proving (10.5). 

10.3 Orthogonal extension of a Lie algebra. 

Let us get back to the general case of a Lie algebra r acting as infinitesimal 
orthogonal transformations on p and the map v : r — > A 2 p given by (9.13). 
Suppose that the Lie algebra r has a non-degenerate invariant symmetric bilinear 
form ( , ) r . We have the transpose map 

: A 2 p — > r 

since both r and A 2 p have non-degenerate symmetric bilinear forms. For y and 
y' in p, let us define 

[y,y'] T :=-2v\yAy'),. 
This map is an r morphism which says that 

[x, [y, y']r] = [x-y, y% + [y,x- y'] r , (10.6) 
where the bracket on the left denotes the Lie bracket on r. Also, we have 

(x,[y>y']r)r = -2{x,v\y Ay')) r 
= -2{u{x),yAy') p 
= -2(i(y)v(x),y') p 

= (x-y,y') P . 



166 



CHAPTER 10. THE KOSTANT DIRAC OPERATOR 



So we have proved 

(x,[y,y'] r ) r = (x-y,y') p . (10.7) 

This has the following significance: Suppose that we want to make r © p into a 
Lie algebra with an invariant symmetric bilinear form ( , ) such that 

• r and p are orthogonal under ( , ), 

• the restriction of ( , ) to r is ( , ) r and the restriction of ( , ) to p is ( , ) p , 
and 

• [r, p] C p and the bracket of an element of r with an element of p is given 
by [x, y] = x-y. 

Then 

the r component of [y, y'\ must be given by [y, y'] T . 

Thus to define a Lie algebra structure on r p we must specify the p 
component of the bracket of two elements of p. This amounts to specifying 
awe A 3 p as we have seen, and the condition that the Jacobi identity hold 
for x, y, y' with x e r and y, y' <G p amounts to the condition that v € A 3 p be 
invariant under the action of r. It then follows that if we try to define [,] = [,]« 

by 

[y,y'] = [y,y']r + HyW)v 

then 

([z,z'],z") = (z,[z',z"}) 

for any three elements of g := r © p, and the Jacobi identity is satisfied if at 
least one of these elements belongs to r. Furthermore, for any x£rwe have 

([[y,y'],y"],x) = ([[y,y'],y"],x) r 

= ([y,y'],[y",x]) p by (10.7) 

= ([x,[y,y%y'X 
= {[[x,y],y'],y") P + {[y, [x,y']],y") p 
= ([x,y],[y',y"}) P + ([x,y'},[y",y}) P 
= (x,[y,[y',y"]) r + (x,[y',[y",y}}) r 

or 

([[y,y'],y"] + [[y',y"],y] + [[y",y],y'],x) = o. 

In other words, the r component of the Jacobi identity holds for three elements 
of p. 

So what remains to be checked is the p component of the Jacobi identity for 
three elements of p. This is the sum 



Jac(6)(y, y 1 , y") + [y, y% ■ y" + [y', y'% ■ y + [y", y] r ■ y'. 
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Let us choose an "orthonormal" basis {xi}, i = 1, . . . , r of r and write 



SO 



[y, y% -y" = ^2 Ci ^ y ' y '^ r ' x ^ Xi ' v " ■ 



Then by (9.11) and (10.4) we see that the Jacobi identity is 
i{yWW) (v 2 + v(Cas r )) - 

where 

Cas r := Y^tiX 2 e U(r) 

i 

does not depend on the choice of basis, and v : U(r) — > C(p) is the extension of 
the homomorphism v : r — > C(p). In particular, we have proved that v defines 
an extension of the Lie algebra structure satisfying our condition if and only if 

u 2 + i/(Cas r )eC (10.8) 

i.e. has no component of degree four. 

Suppose that this condition holds. We then have defined a Lie algebra 
structure on 

g = r ® p. 

We let P r and P p denote projections onto the first and second components of 
our decomposition. Our Lie bracket on g, denoted simply by [ , ] satisfies 

[x,x'\ = [x,x'] T , x,x' er (10.9) 

[x,y] = x-y, xer,yep (10.10) 

P T [y,y'] - [y,y'] r = -2^(yAy') y,y'e P (10.11) 

P P [y,y'} = b(y,y') = 2t(y)i(y , )v, y,y'e P . (10.12) 

From now on we will assume that we are over the complex numbers or that 
we are over the reals and the symmetric bilinear forms are positive definite. 
This is not for any mathematical reasons but because the formulas become a 
bit complicated if we put in all the signs. We leave the general case to the 
reader. 



10.4 The value of [v 2 + z/(Cas r )] . 

Condition (10.8) says that the degree four component of v 2 + v(Cas T ) vanishes. 
Assume that this holds, so we have constructed a Lie algebra. We will now 
compute the degree zero component of v 2 + ^(Cas r ). The answer will be given 
in equation (10.13) below. 
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We can write (10.5) as 

1 " 

= 24 tr Pp ^ ad s(yj) p p ad g(%) p p 



1 " 

- tr P p ^ ad g ( yj ) P p ad g (y 3 ) 



24 

in view of (10.12) where ad g denotes the adjoint action on all of g. On the other 
hand, we have from (9.12) that 

1 1 r " 

z/(Cas r ) = g tr^(ad^) 2 P p = - ^ ^([x i7 [a:,, % ]], %)• 

We can rewrite this sum as 

_^ r,n 

g 51 ([^'fjitl'j'^]) 
»=i,j=i 

which equals | X)i=i%i fc=i([ x ^ Vj],yk)(yk, [yj,Xi])- But this equals 



^ r,n,n 

*=i,i=i,fe=i 

= ^ XI (^r[%-,yfe],Pr[yfe,%]) 
J=l,fe=l 

= g J! (■ P r[%-,J/fe],[j/fc,J/j]) 
j'=l,fe=l 

= g ([yj' p r[yj,yk]],yk) 
i=i,fe=i 

i " 

= g tr P p X ad g ) p r ad g ( % ) P p . 

In other words 

^ n _^ n 

KCas r ) = g tr P p ^ adg(yj)P r ad g (^)P p = - tr ^ ad g ( % )P r ad g ( % )P p . 

i=i i=i 

Multiplying this equation by 1/3 and adding it to the above expression for (f 2 )o 
gives 

1 1 " 

-i/(Cas r ) + (v 2 ) = ^tr^(adg % ) 2 P p . 
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We can write 

v(Cas r ) = - d^' vMVh x i\) = g XI 

Z= l.j — 1 « — — 1 

_^ n 1 n 

= g tr Pr ]T(ad g % ) 2 P r = - tr ^T(ad g Vj ) 2 P r . 

3=1 J=l 

Multiplying by 1 /3 and adding to the preceding equation gives 

2 1 " 

-^(Cas r ) + 2 )o = ^tr^(ad g% ) 2 . 

On the other hand 

1 r '" 1 
f(Cas r ) = - X {[x it [xi, yj]],Vj) = -trad p (Cas r ) 

i=l,j=l 

= ^ (tradg(Cas r ) — tr ad r (Cas r )) . 
8 

Multiplying by 1/3 and adding to the preceding equation, and using the fact 
that Casg = Cas r + J2v] gives 

^(Cas r ) + v 2 = ^ (tr adg(CaSg) - tr ad r (Cas r )) (10.13) 
when (10.8) holds. 

Suppose now that the Lie algebra r is reductive and that the Lie algebra g 
we created out of r and p using a v € A 3 p satisfying (10.8) is also reductive. 
Using (7.29) for g and for r in the right hand side of (10.13) yields 

KCas r ) + v 2 = ((p g , Ps ) - (p r , p r )) . (10.14) 



10.5 Kostant's Dirac Operator. 

Suppose that we have constructed our Lie algebra g = r + p from a v e A 3 p 
satisfying (10.8). We are going to define 

£ g u( s ) ® c( P ) 

as follows: Let yi, . . . , y n be an orthonormal basis of p. Then 

J(:=^(/,®!/, + l®». (10.15) 

i 

(On the left of the tensor product sign the yi £ p is considered as an element 
of C/(g) via the canonical injection of p in U (p) C U (g) and on the right of the 
tensor product sign it lies in C(p) via the canonical injection of p into C(p).) 
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Wc have a homomorphism U (r) — > C/(g), in particular a Lie algebra injection 
r — ► [/(g). We also have a Lie algebra homomorphism v : r — > C(p). In 
particular, we have the diagonal Lie algebra map 

diag : r — > 17(g) ® C(p) , diag(x) = a; ® 1 + 1 ® f(a;) 

and this extends to an algebra map 

diag :U{v)^ [7(g) ®C(p). 

For example, 

diag(Cas r ) = ^ (a;j ® 1 + 1 ® ^(xi)) 
where orthonormal basis of r. In other words 

r r r 

diag(Cas r ) = ^ a: 2 ® 1 + 2 ^ ® i/fo) + ^ 1 ® v{x t ) 2 . (10.16) 

i— 1 i—1 i— 1 

Wc claim that 

JC 2 = Cas g (g)l - diag(Cas r ) + ^ (trad g (Cas g ) - tr ad r (Cas r )) 1 ® 1. (10.17) 
To prove this, let us write (10.15) as 

So 

(JC") 2 = 1® V 2 

and hence 

r 1 
(JC") 2 + Y 1 = ™ ( tr ad g( Cas g) - tr ad r (Cas r )) 1 ® 1 

by (10.13). We have 

(^') 2 = Y yiy i® yiy i 

= Y y i ® 1 + Y ViVi ® 

= Y Vi ® 1 + Y( yiy j ~ yjVi) 



i<j 



Y y i 01 1 ~~ 2 5Z v ^ Vt A ® ^ A % + 2 X! L (yi) L (Vo) v ®Vt A Vh 



i<j i<j 
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where we have used the decomposition of [yi, yj] into its r and p components to 
get to the last expression. We can write the middle term in the last expression 
as 

r 

- 2 ^2 u Hvi Ayj) <g>2ft Ayj = -2^^(i/ t (y i Ay :) ),x fe )x fc (X)?/iAy :) - 

i<j fc=l i<j 

r 

= -2 y~] y^xvi a yj^( x k))xk ® y% a yj 

fc=l i<j 
r 

= -^y^y^xk ® fa A yj,v{x k ))yi Ayj 

fc=l i<j 

= -2^^® y{xj). 

i 

Since ^xf<E>l + ^y|<E>l = Cas g 01 we conclude that 

(JC') 2 + (r) 2 + diag(Cas r ) 

= CaSg ®1 + ^ (tr adg(CaSg) — tr ad r (Cas r )) 1 ® 1 + 2 ^ i(yi)i{yj)v®yif\yj. 

i<j 

To complete the proof of (10.17) we must show that 

+ = L( yi )L( yj )v ® Vl A y 3 . 

i<j 

But 

= ^yy^yj®^{yj)v,yiAy k )yiAy k 

i<k j 

= YY y i®^ 2v ' VtAykhy ^ yihyk 

i<k j 

= y 3 ® ( 2i (yk)i<(yi)v, yj)yi a yj 

i<k j 

= Y X^ 2 ^)^)^ y o) y o ® y * A y j 

i<k j 

= y2 2 ^ yk ^ L ^ v(g>yiAyk ' 

i<k 

completing the proof. 

In the case where r and g are reductive we have the alternative formula 

JC 2 = Casg 01 - diag(Cas r ) + ((p s , p g ) - (p T , p r )) 1 <g> 1. (10.18) 
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Suppose that A is the highest weight of a finite dimensional irreducible rep- 
resentation V\ of g so that we get a surjective homomorphism 

U(g) - End(y A ) 

and hence a corresponding homomorphism 

U(g)®C(p) ^End(F A )®C(p). 

Also let diag A denote the composition of this homomorphism with 

diag:(7(r)^U(g)®C(p). 

Then from the value of the Casimir (7.8) and (10.18) we get 

= ((A + p e , A + pg) - {p r , Pr)) 10 1- diag A (Cas r ). (10.19) 

10.6 Eigenvalues of the Dirac operator. 

We consider the situation where g = r © p is a Lie algebra with invariant 
symmetric bilinear form, where r has the same rank as g, and where we have 
chosen a common Cartan subalgebra 

h C r C g. 

We let I denote the dimension of h, i.e. the common rank of r and g. We let 
$ = denote the set of roots of g, let W = W s denote the Weyl group of g, 
and let W r denote the Weyl group of r so that 

W r C W 

and we let c denote the index of W T in W. 

A choice of positive roots <I> + for g amounts to a choice of a Borel subalgebra 
b of g and then b n r is a Borel subalgebra of r, which picks out a system of 
positive roots for r and then 

$+ C 

The corresponding Weyl chambers are 

D = D g = {\eh^\(\,<f))>0 V^G$+} 

and 

A- = {A e h^|(A» >0 V0e$+} 

so 

D C D r 

and we have chosen a cross-section C of W r in W as 



C = {we W\wDcD r } 7 
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so 

W = W r ■ C, D r = |J wD. 

wee 

We let L = L g C hj^ denote the lattice of g integral linear forms on h, i.e. 

L = { M eh*|2^4veZV0eA}. 
(<?,<?) 



We let 

and 

We set 
and 



2 



= 5 S ' 



* 2 



the lattice spanned by L and p r 



A :=LnD, A r := L r fl D r . 

For any r module Z we let r(Z) denote its set of weights, and we shall 
assume that 

T(Z) C L r . 

For such a representation define 

m z := max (7 + p r ,j + p r ). (10.20) 

For any p e A r we let denote the irreducible module with highest weight p. 
Proposition 30 Let 

r max (Z) := {p g T(Z)\(p + p r ,p + p r ) =m z }. 
Let fi g r max (Z). Tften 

L yU g A r . 

2. If z ^0 is a weight vector with weight fi then z is a highest weight vector, 
and hence the submodule U(r)z is irreducible and equivalent to Z^. 

3. Let 

per max (z) 

and 

Y := U(r)Y max . 

TTien mz — (p T ,p r ) is the maximal eigenvalue o/Cas r on Z and Y is the 
corresponding eigenspace. 
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Proof. We first show that 

M € Tmax => M + Pr e A r . 

Suppose not, so there exists aw/1, w G W T such that 

wp + wp r e A r . 

But w changes the sign of some of the positive roots (the number of such changes 
being equal the length of w in terms of the generating reflections), and so p r ~wp r 
is a non-trivial sum of positive roots. Therefore 

(wp + Wp r , p r — Wp r ) > 0, (p r — Wp r , p r — Wp r ) > 

and 

Wp + p r = (Wfl + Wp r ) + (p r - Wp r ) 

satisfies 

(wp + Pr, Wp + pr) > {wp, + Wpr, Wp + W p r ) = (p + p r , p + p) = m Z 

contradicting the definition of mz- Now suppose that z is a weight vector 
with weight p which is not a highest weight vector. Then there will be some 
irreducible component of Z containing z and having some weight p! such that 
p! — p is a non trivial sum of positive roots. We have 

p' + Pr = (// - p) + (p + Pr) 
so by the same argument we conclude that 

(fjf + p r ,// + p r ) > m z 

since p + p r € A r , and again this is impossible. Hence z is a highest weight 
vector implying that p G A r . This proves 1) and 2). 

We have already verified that the eigenvalue of the Casimir Cas r on any Z 1 
is (7 + pr, 7 + Pr) - (p r , Pr) ■ This proves 3). 

Consider the irreducible representation V p of g corresponding to p = p % . By 
the same arguments, any weight 7 ^ p of V p lying in D must satisfy (7, 7) < 
(p, p) and hence any weight 7 of V p satisfying (7, 7) = (p, p) must be of the form 

7 = wp 

for a unique w <G W. But 

4>£Ju, 

where 

J w := w(-$+)n$+. 
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We know that all the weights of V p are of the form p — <j>j as J ranges over all 
subsets of <I> + . So 

(p,p)>(p-<j>j,p-cj>j) (10.21) 

where we have strict inequality unless J = J w for some w e W. 

Now let A € A, let V\ be the corresponding irreducible module with highest 
weight A and let 7 be a weight of V\. As usual, let J denote a subset of the 
positive roots, J C <f> + . We claim that 

Proposition 31 We have 

(A + p,A + p) > fr + p-fa-y + p-fa) (10.22) 

with strict inequality unless there exists aw^W such that 

7 = wX, and J = J w 
in which case the w is unique. 
Proof. Choose w such that 

u> _1 (7 + p- 4>j) e A. 

Since w^ 1 ^) is a weight of V\, X — w^ 1 ^) is a sum (possibly empty) of positive 
roots. Also w~ 1 (p — 4>j) is a weight of V p and hence p — w~ 1 (p — <pj) is a sum 
(possibly empty) of positive roots. Since 

A + p = (A - w~ 1 'y) + (p - iw _1 (p - 0j) + w _1 (7 + p - </>j)), 

we conclude that 

(A + p, A + p) > (w" 1 (7 + p -<f>j), w _1 (7 + p -<t>j)) = (7 + p- 0,7,7 + p-0j) 

with strict inequality unless A — w^ 1 ^) = = p — w _1 (p — (f>j), and this last 
equality implies that J — J w . QED 

We have the spin representation Spin v where v : r — > C(p). Call this 
module 5. Consider 

V X ®S 

as a r module. Then, letting 7 denote a weight of V\, we have 

r(V A ®S) = {p = 7 + Pp-<M (10-23) 

where 

Je<s>+ 

In other words, <& p are the roots of g which are not roots of r, or, put another 
way, they are the weights of p considered as a r module. (Our equal rank 
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assumption says that does not occur as one of these weights.) For the weights 
liofV\®S the form (10.23) gives 

P + Pr = 1 + P- 4>.J, JCA+. 

So if we set Z = V\ <£> S as a r module, (10.22) says that 

(A + p, A + p) > m z . 

But we may take J = as one of our weights showing that 

m z = {X + p s ,X + p g ). (10.24) 

To determine T max (Z) as in Prop. 30 we again use Prop. 31 and (10.23): A 
p = 7 + p p — 4>j belongs to r max (Z) if and only if 7 = wX and J = J w . But 
then 

Since p g = p r + p p we see from the form (10.23) that 

p + p r = w(X + p g ) (10.25) 

where w is unique, and 

J w C *+. 

We claim that this condition is the same as the condition w{D) C D r defining 
our cross-section, C. Indeed, weCif and only if {<p, wp s ) > 0, V G But 
(<f>,wp) = (io _1 ^,p) > if and only if e w;(<f> + ). Since J w = w(-<f>+) n <f> + , 
we see that J w C is equivalent to the condition w € C. 
Now for p <E r max (iT) we have 

p = w(X + p) - p r =: w A (10.26) 

where 7 = w(A) and so has multiplicity one in V\. 

Furthermore, we claim that the weight p p — has multiplicity one in S. 
Indeed, consider the representation 

Z Pr ®S 

of r. It has the weight p = p r + p p as a highest weight, and in fact, all of 
the weights of V p% occur among its weights. Hence, on dimensional grounds, 
say from the Weyl character formula, we conclude that it coincides, as a rep- 
resentation of r, with the restriction of the representation V Ps to r. But since 
Pg ~ 4>J m = w Pg nas multiplicity one in V Pg , we conclude that p p — 4>j w has 
multiplicity one in S. 

We have proved that each of the w • A have multiplicity one in V\ (g> S with 
corresponding weight vectors 

z w .\ := v w \ ® ei w e + . 
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So each of the submodules 

Z w . x := U{v)z w . x (10.27) 

occurs with multiplicity one in V\ <£> S. The length of w e C (in terms of 
the simple reflections of W determined by A) is the number of positive roots 
changed into negative roots, i.e. the cardinality of J w . This cardinality is the 
sign of det w and also determines whether e{_ e + belongs to S + or to 5_ . From 
Prop. 31 and equation (10.24) we know that the maximum eigenvalue of Cas r 
on V\ <g> S is 

(A + Pg,A + /? g ) - (p T ,p r ). 

Now JC A <G End(Vx ® S) commutes with the action of r with 

V x ® S+ -» y A ® S_ 

Furthermore, by (10.19), the kernel of JC A is the cigenspace of Cas r correspond- 
ing to the eigenvalue (A + p, A + p) — (p r ,p r ). Thus 

Kcr(^) = J2 Z w . x . 

wee 

Each of these modules lies either in V <£> 5+ or V <E) S- , one or the other but not 
both. Hence 

Kcr(^)=Kcr(JC A ) 

and so 

K ™Wx)\v X0S+ = E z ^ ( 10 - 28 ) 

w£C. dctw— 1 

and _ 

Kcr (^)U« s _ = E ^ ( 10 - 29 ) 

idGC, det w — — 1 

Let 

#± := ( 10 ' 3 °) 

tuGC, detw=±l 

It follows from (10.28) that JC A induces an injection of 

(V x ® S+)/K+ -^V®S- 
which we can follow by the projection 

V X ®S_^ {Vx®S_)/K_. 
Hence JC A induces a bijection 

tx ■ {V®S+)/K+ -» {Vx®S-)/K_. (10.31) 
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In short, we have proved that the sequence 

— > K + —*V\® S + —*V\(B) S- — > K_ — > (10.32) 

is exact in a very precise sense, where the middle map is the Kostant Dirac 
operator: each summand of K + occurs exactly once in V\ <g> S+ and similarly for 
K_. This gives a much more precise statement of Theorem 16 and a completely 
different proof. 

10.7 The geometric index theorem. 

Let r be the representation of G on the space !F(G) of smooth or on L 2 (G) of 
L 2 functions on G coming from right multiplication. Thus 

[r(g)f](a) = f(ag). 

Then JC acts on T{G) ® S or on L 2 (G) (g> S* and centralizes the action of diagr. 
If U is a module for i?, we may consider T{G) ® S ® U or L 2 (G) ®5®(7, 
and JC <g> 1 commutes with diagr £g> 1 and with the action p of i? on [7, i.e with 
1 ® 1 ® p. If i? is connected, this implies that commutes with the diagonal 
action of R, the universal cover of R, on T <£> 5 ® 17 or L 2 (G) <g> S 1 ® {/ given by 

fc r(fc) ® Spin(fc) ® fc e -R 

where Spin : i? — » Spin(p) is the group homomorphism corresponding to the 
Lie algebra homomorphism v. If G/R is a spin manifold, the invariants under 
this R action correspond to smooth or L 2 sections of S ® U where S is the 
spin bundle of G/R and hi is the vector bundle on G/R corresponding to U. 
Thus JC descends (by restriction) to a differential operator ^ on G/R and we 
shall compute its G-index for irreducible U. The key result, due to Landweber, 
asserts that if U belongs to a multiplct coming from an irreducible V of G, then 
this index is, up to a sign, equal to V . If U does not belong to a multiplct, then 
this index is zero. We begin with some preliminary results due to Bott. 

10.7.1 The index of equivariant Fredholm maps. 

Let E and F be Hilbert spaces which are unitary modules for the compact Lie 
group G. Suppose that 

n n 

are completed direct sum decompositions into subspaces which are G-invariant 
and finite dimensional, and that 



T :E^ F 
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is a Fredholm map (finite dimensional kernel and cokernel) such that 

T{E n ) C F n . 

We write 

Indcx G T = Kcr T - Cokcr T 

as an element of R(G), the ring of virtual representations of G. Thus R(G) 
is the space of finite linear combinations a\V\, a\ £ Z as V\ ranges over 
the irreducible representations of G. (Here, and in what follows, we are regard- 
ing any finite dimensional representation of G as an element of R(G) by its 
decomposition into irreducibles, and similarly the difference of any two finite 
dimensional representations is an element of R(G).) 
If we denote the restriction of T to E n by T n , then 

Indcx G T = Y^ Index G T n 

where all but a finite number of terms on the right vanish. For each n we have 
the exact sequence 

-» Kcr T n ^E n ^ F n -> Cokcr T n -» 0. 

Thus 

Indcx G T n = E n - F n 
as elements of R(G). Therefore we can write 

rndcx G T = ]r(£ n -F„) 

in R(G), where all but a finite number of terms on the right vanish 
refer to this as Bott's equation. 



(10.33) 
. We shall 



10.7.2 Induced representations and Bott's theorem. 

Let R be a closed subgroup of G. Given any i?-action p on a vector space U, 
we consider the associated vector bundle G x r V over the homogeneous space 
G/R. The sections of this bundle are then equi variant U- valued functions on G 
satisfying s(gk) — p(k)' 1 s(g) for all k € R. Applying the Peter-Weyl theorem, 
we can decompose the space of L 2 maps from G to U into a sum over the 
irreducible representations V\ of G, 

l 2 (g) a f a ® V? ® U, 

with respect to the G x G x R action I ® r ® p. The i?-equivariance condition 
is equivalent to requiring that the functions be invariant under the diagonal 
i?-action k \— > r(fc) ® p(fc). Restricting the Peter-Weyl decomposition above to 
the i? invariant subspace, we obtain 



R 

(10.34) 
® x V x ®ttom R (V x ,U). 



L 2 (Gx R U) =® X V X ®(VZ®U) 
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The Lie group G acts on the space of sections by 1(g), the left action of G on 
functions, which is preserved by this construction. The space L 2 (G x# U) is 
thus an infinite dimensional representation of G. 

The intertwining number of two representations gives us an inner product 

(V, W) G = dime Hom G (y, W) 

on R(G), with respect to which the irreducible representations of G form an 
orthonormal basis. Taking the formal completion of R(G), we define R(G) to 
be the space of possibly infinite formal sums ^2 x a\V\. The intertwining number 
then extends to a pairing R(G) x R(G) — > Z. 

If R is a subgroup of G, every representation of G automatically restricts 
to a representation of R. This gives us a pullback map i* : R(G) — > R(i?), 
corresponding to the inclusion i : R ^> G. The map £/ L 2 (G x H U) dis- 
cussed above assigns to each ^-representation an induced infinite dimensional 
G-representation. Expressed in terms of our representation ring notation, this 
induction map becomes the homomorphism i* : R(R) — > R(G) given by 

A 

the formal adjoint to the pullback i*. This is the content of the Frobenius 
reciprocity theorem. 

A homogeneous differential operator on G/R is a differential operator D : 
T(£) —* T(f) between two homogeneous vector bundles £ and T that commutes 
with the left action of G on sections. If the operator is elliptic, then its kernel and 
cokernel are both finite dimensional representations of G, and thus its G-index 
is a virtual representation in R(G). In this case, the index takes a particularly 
elegant form. 

Theorem 18 (Bott) If D : T(G x H U ) — ► T(G x H Ui) is an elliptic homoge- 
neous differential operator, then the G-equivariant index of D is given by 

Index G D = i^(U - U\), 

where i«(Uo — U\) is a finite element in R(G), i.e. belongs to R(G). 

In particular, note that the index of a homogeneous differential operator 
depends only on the vector bundles involved and not on the operator itself! To 
prove the theorem, just use Bott's formula (10.33), where the subscript n is 
replaced by A labeling the the G-irreducibles. 

10.7.3 Landweber's index theorem. 

Suppose that G is semi-simple and simply connected and R is a reductive sub- 
group of maximal rank. Suppose further that G/R is a spin manifold, then we 
can compose the spin representation S = S + ® S- of Spin(p) with the lifted 
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map Spin : R — > Spin(p) to obtain a homogeneous vector bundle, the spin bun- 
dle S over G/R. For any representation of R on U the Kostant Dirac operator 
descends to an operator 

9 V :T(S ± (g>U) -^T(S T ®U). 

(This operator has the same symbol as the Dirac operator arising from the 
Levi-Civita connection on G/R twisted by U, and has the same index by Bott's 
theorem. For the precise relation between this Dirac operator coming from 
K and the dirac operator coming from the Levi- civita connection we refer to 
Landweber's thesis.) 

The following theorem of Landweber gives an expression for the index of this 
Kostant Dirac operator. In particular, if we consider G/T, where T is a maximal 
torus (which is always a spin manifold), this theorem becomes a version of the 
Borcl-Wcil-Bott theorem expressed in terms of spinors and the Dirac operator, 
instead of in its customary form involving holomorphic sections and Dolbcault 
cohomology. 

Theorem 19 (Landweber) Let G/R be a spin manifold, and let be an 
irreducible representation U p of R with highest weight fi. The G-equivariant 
index of the Dirac operator ^ v is the virtual G -representation 

Index G ^ = (-l) dim p/ 2 (-l) w V Mp+PH) _ PG (10.35) 

if there exists an element w e Wq in the Weyl group of G such that the weight 
w(fi + p H ) — pa is dominant for G. If no such w exists, then Index^ ft v = 0. 

Proof. For any irreducible representation V\ of G with highest weight A we 
have 

V x ®(S + -S-)= J2(- l ) Wu ^x 

w£C 

by [GKRS] . Hence 

Rom R (V x ®(S + -S_),U,) = 
if fi ^ w • A for some ideC while 

Hom^(T/ A ® (S+ - S-),U,) = (-l) w 

if = w • A. But, by (10.33) and Theorem 18 we have 

Indcx G ^ - Qv x ®(VZ®(S + -S_)®U p ) R 

= 0Hom fl (Vx®(S + -S_)*,l7 M ). 

Now (S*+ - = S+ - S_ if dimp = mod(4) while (S+ - S_)* = S_ - S + 
if dimp = 2 mod(4). Hence 

Index G ^ = (-l)di mP /20 Homfl(FA {s+ _ S _),U,). (10.36) 
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The right hand side of (10.36) vanishes if p does not belong to a multiplet, i.e 
is not of the form 

w • A = w(X + p g ) — p r 
for some A. The condition w • A = p can thus be written as 

W~ 1 (p + p r ) - p g = A. 

If this equation does hold, then we get the formula in the theorem (with w 
replaced by w^ 1 which has the same determinant). QED 

In general, if G/R is not a spin manifold, then in order to obtain a similar 
result we must instead consider the operator 

^ : (L 2 (G) ® (S ± ) ® -> (L 2 (G) ® ® 

viewed as an operator on G, restricted to the space of (S® C/ /i )-valued functions 
on G that are invariant under the diagonal r-action g(Z) = diag(Z) + <r(Z), 
where a is the r-action on U^. Note that if S^iU^ is induced by a representation 
of the Lie group R, then this operator descends to a well-defined operator on 
G/R as before. In general, the G-cquivariant index of this operator is once 
again given by (10.35). To prove this, we note that Bott's identity (10.33) and 
his theorem continue to hold for the induction map : R(r) — ■> R(g) using the 
representation rings for the Lie algebras instead of the Lie groups. Working in 
the Lie algebra context, we no longer need concern ourselves with the topological 
obstructions occurring in the global Lie group picture. The rest of the proof of 
Theorem 19 continues unchanged. 



Chapter 11 

The center of U(g). 



The purpose of this chapter is to study the center of the universal enveloping 
algebra of a semi-simple Lie algebra g. We have already made use of the (second 
order) Casimir element. 

11.1 The Harish- Chandra isomorphism. 

Let us return to the situation and notation of Section 7.3. We have the monomial 
basis 

f *i . . . f *m h h . . . U 31 pfcl . . . pk m 
Jl Jm 'n n e e l e m 

of U(g), the decomposition 

C/(g) = U(h) (C/(g)n+ + n_*7(g)) 

and the projection 

7 : C/(g) - (7(h) 

onto the first factor of this decomposition. This projection restricts to a projec- 
tion, also denoted by 7 

7 : Z(g) -> C/(h). 

The projection 7 : Z(g) — > t/(h) is a bit awkward. However Harish-Chandra 
showed that by making a slight modification in 7 we get an isomorphism of 
Z(g) onto the ring of Weyl group invariants of U(h) = 5(h). Harish-Chandra's 
modification is as follows: As usual, let 

Recall that for each i, the reflection Sj sends on ^ —on and permutes the 
remaining positive roots. Hence 

SiP = Q - OSi 
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But by definition, 

SiP = p- (p,ai)ai 

and so 

(P, <*») = 1 

for all i — 1, . . . , m. So 

P = uji j i \-uJ m , 

i.e. p is the sum of the fundamental weights. 

11.1.1 Statement 

Define 

a:h^U(h), a{h) = h - p(h)l. (11.1) 

This is a linear map from h to the commutative algebra [/(h) and hence, by 
the universal property of [/(h), extends to an algebra homomorphism of U(h) 
to itself which is clearly an automorphism. We will continue to denote this 
automorphism by a. Set 

7h := a o 7. 

Then Harish-Chandra's theorem asserts that 

1H : Z(g) [/(h) w 
and is an isomorphism of algebras. 

11.1.2 Example of sl(2). 

To see what is going on let's look at this simplest case. The Casimir of degree 
two is 

l -h 2 + ef + fe, 

as can be seen from the definition. Or it can be checked directly that this 
element is in the center. It is not written in our standard form which requires 
that the / be on the left. But ef = fe + [e, /] = fe + h. So the way of writing 
this element in terms of the above basis is 

±h 2 + h + 2fe, 

and applying 7 to it yields 

There is only one positive root and its value on h is 2, so p(h) = 1. Thus a 
sends |/i 2 + h into 

\{h-\? + h-\ = \h 2 -h+\ + h-\ = \{h 2 -\). 

The Weyl group in this case is just the identity together with the reflection 
h 1 > —h, and the expression on the right is clearly Weyl group invariant. 
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11.1.3 Using Verma modules to prove that ^ H : Z(g) — > 
U(h) w . 

Any ^ G h* can be thought of as a linear map of h into the commutative 
algebra, C and hence extends to an algebra homomorphism of U(h) to C. If 
we regard an element of /7(h) = S(h) as a polynomial function on h*, then this 
homomorphism is just evaluation at p. 
From our definitions, 

{\-p) 1 {z) = lH {z){\) VzGZ(g). 

Let us consider the Verma module Verm (A — p) where we denote its highest 
weight vector by v + . For any z G Z(g), we have hzv + = zhv + = (A — p)(h)zv + 
and e,izv + = ze,iv + = 0. So zv + is a highest weight vector with weight A — p 
and hence must be some multiple of v + . Call this multiple f\(z). Then 

so z acts as scalar multiplication by <f\(z) on all of Vcrm(A — p). To see what 
this scalar is, observe that since z — "f(z) G [/(g)n + , we see that z has the same 
action on v + as does 7(2) which is multiplication by (A — p){^{z)) = A(7#(z)). 
In other words, 

ip x {z) = \{1h{z)) = 7 H (»(A). 

Notice that in this argument we only used the fact that Verm(A— p) is a cyclic 
highest weight module: If V is any cyclic highest weight module with highest 
weight p — p then z acts as multiplication by f^(z) = p{^h{z)) = -fH(z)(p)- 
We will use this observation in a moment. 

Getting back to the case of Vcrm(A — p), for a simple root a = on let 
m = m, := (A, ai) and suppose that m is an integer. The element 

/>+ G Vcrm(A - p)„ 

where 

p, = A — p — ma = SjA — p. 

Now from the point of view of the sl(2) generated by ei.fi, the vector v + 
is a maximal weight vector with weight m — 1. Hence eif" l v + = 0. Since 
[ej,fi\ — 0, i 7^ j we have ej/™u + — as well. So f™v + ^ is a maximal 
weight vector with weight SiA — p. Call the highest weight module it generates, 
M. Then from M we see that 

<P Si \(z) = f\(z). 

Hence we have proved that 



( 7H (z))(w\) - (jh(z))(\) VweW 
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if A is dominant integral. But two polynomials which agree on all dominant 
integral weights must agree everywhere. We have shown that the image of 7 
lies in S(h) w . 

Furthermore, we have 

ziz 2 -7(21)7(22) = zi(z 2 -7(22)) +l(z 2 )(z 1 -7(22)) € U(g)n + . 

So 

This says that 7 is an algebra homomorphism, and since = (707 where a is 
an automorphism, we conclude that is a homomorphism of algebras. 

Equally well, we can argue directly from the fact that z G Z(g) acts as 
multiplication by (f\(z) = jh(z)(X) on Vcrm(A — p) that 7# is an algebra 
homomorphism. 

11.1.4 Outline of proof of bijectivity. 

To complete the proof of Harish-Chandra's theorem we must prove that j H 
is a bijection. For this we will introduce some intermediate spaces and ho- 
momorphisms. Let Y(g) := 5(g) g denote the subspace fixed by the adjoint 
representation (extended to the symmetric algebra by derivations). This is a 
subalgebra, and the filtration on 5(g) induces a filtration on Y(g). We shall 
produce an isomorphism 

/ : F(g) S(h) w . 

We also have a linear space isomorphism of U(g) — > 5(g) given by the symmetric 
embedding of elements of 5 ffc (g) into U(g), and let s be the restriction of this 
to Z(g). We shall see that s : Z(g) — > F(g) is an isomorphism. Finally, define 

^(g) = ^ (g)©^(g)©---©^ fe (g) 

so as to get a filtration on S*(g). This induces a filtration on 5(h) C 5(g). We 
shall show that for any z e £4(g) n Z(g) we have 

(f°s)(z) = jh(z) mod 5 fe _i(g). 

This proves inductively that jh is an isomorphism since s and / are. Also, since 
a does not change the highest order component of an clement in 5(h), it will 
be enough to prove that for z G £4(g) n Z(g) we have 

(/o S )(z)= 7 (z) mod Sn(g). (11.2) 

We now proceed to the details. 
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11.1.5 Restriction from S(g*) s to S(h*) w . 

We first discuss polynomials on g — that is elements of S(g*). Let r be a finite 
dimensional representation of of g, and consider the symmetric function F of 
degree k on g given by 

(x 1 ,...,x k )^J2 tr ( T ( x -i) ' ' ' T i x ^)) 

where the sum is over all permutations. For any Y € g, by definition, 

YF(X 1 ,...,X k )=F([Y,X 1 ],X 2 ,...,X k ) + ... + F(X 1 ,...,X k _ 1 ,[Y,X k ]). 
Applied to the above 

F(X u ...,X k )=trT(X 1 )---r(X k ) 

we get 

tr t{Y)t{X x ) ■ ■ ■ r(X k ) - tr ^X^Y) ■ ■ ■ r(X k ) 

+ tlT(X 1 )T(X)T(X 2 ---T(X k )---- 

= tr t{Y)t{Xi) ■ ■ ■ r{X k ) - tr T {X X ) ■ ■ ■ r{X k )r{Y) = 0. 
In other words, the function 

X i ► tr r(X) n 

belongs to S(g*) s . Now since h is a subspace of g, the restriction map induces 
a homomorphism, 

r:S(g*)^S(h*). 

If F G 5(g*) g , then, as a function on g it is invariant under the automorphism 
Tj := (exp ad ei)(exp ad — /j)(exp ad ei) and hence 

r : S(g*) e -> S(h*) w . 

If F G ^(g*) 8 is such that its restriction to h vanishes, then its value at any 
element which is conjugate to an element of h (under £(g) the subgroup of 
automorphisms of g generated by the n) must also vanish. But these include 
a dense set in g, so F, being continuous, must vanish everywhere. So the 
restriction of r to 5(g*) s is injcctive. 

To prove that it is surjective, it is enough to prove that S(h*) w is spanned 
by all functions of the form X trr(X) fc as t ranges over all finite dimensional 
representations and k ranges over all non-negative integers. Now the powers of 
any set of spanning elements of h* span 5(h*). So we can write any element 
of S(h*) w as a linear combination of the AX k where A denotes averaging over 
W. So it is enough to show that for any dominant weight A, we can express 
A fe in terms of trr fc . Let E\ denote the (finite) set of all dominant weights 
-< A. Let r denote the finite dimensional representation with highest weight A. 
Then trr(X) k — AX k (X) is a combination of A/J,(X) k where /i e E\. Hence 
by induction on the finite set E\ we get the desired result. In short, we have 
proved that 

r : 5(g*) s -» S(h*) w 

is bijective. 
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11.1.6 From S(g) s to S(h) w . 

Now we transfer all this information from 5(g*) to 5(g): Use the Killing form 
to identify g with g* and hence get an isomorphism 

a : 5(g) - 5(g*). 

Similarly, let 

: 5(h) -» 5(h*) 

be the isomorphism induced by the restriction of the Killing form to h, which 
we know to be non-degenerate. Notice that commutes with the action of the 
Weyl group. We can write 

5(g) = 5(h) + J 
where J is the ideal in 5(g) generated by n + and n_. Let 

3 : 5(g) -> 5(h) 

denote the homomorphism obtained by quotienting out by this ideal. We claim 
that the diagram 

5(g) — ^ 5(g*) 

j { i r 
5(h)' — ^ 5(h*) 

commutes. Indeed, since all maps are algebra homomorphisms, it is enough to 
check this on generators, that is on elements of g. If X e g, then 

(h,ra(X)) = (h,a(X)) = (h,X) 

where the scalar product on the right is the Killing form. But since h is orthog- 
onal under the Killing form to n + + n_ , we have 

(h,X) = (h,jX) = (h,0(jX)). QED 

Upon restriction to the g and lU-invariants, we have proved that the right hand 
column is a bijection, and hence so is the left hand column, since is a W- 
module morphism. Recalling that we have defined Y(g) := 5(g) s , we have 
shown that the restriction of j to Y(g) is an isomorphism, call it /: 

/ : F(g) 5(h) w . 

11.1.7 Completion of the proof. 

Now we have a canonical linear bijection of 5(g) with U(g) which maps 
%)3Xr--X r ^i ]T X wl ---X nr , 
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where the multiplication on the left is in 5(g) and the multiplication on the 
right is in U(g) and where S r denotes the permutation group on r letters. This 
map is a g module morphism. In particular this map induces a bijection 

*:Z(g)->r(g). 

Our proof will be complete once we prove (11.2). This is a calculation: write 

uajb := f A h J e B 

for our usual monomial basis, where the multiplication on the right is in the 
universal enveloping algebra. Let us also write 

Pajb := f A h J e B = ft ■■■h{ l ---hf--- e k J g 5(g) 

where now the powers and multiplication are in 5(g). The image of uajb under 
the canonical isomorphism with of U(g) with 5(g) will not be pajb in general, 
but will differ from pajb by a term of lower filtration degree. Now the projection 
7 : £7(g) — > U(h) coming from the decomposition 

C/(g) = U(h) (n_[/(g) + U( S )n+) 

sends uajb > unless A = = B and is the identity on u jo- Similarly, 

3 {pajb) = unless A = = B 

and 

j(pojo) =Pojo = h J . 
These two facts complete the proof of (11 .2). QED 

11.2 Chevalley's theorem. 

Harish Chandra's theorem says that the center of the universal enveloping alge- 
bra of a semi-simple Lie group is isomorphic to the ring of Weyl group invariants 
in the polyinomial algebra 5(h). Chevalley's theorem asserts that this ring is 
in fact a polynomial ring in £ generators where £ = dimh. To prove Cheval- 
ley's theorem we need to call on some facts from field theory and from the 
representation theory of finite groups. 

11.2.1 Transcendence degrees. 

A field extension L : K is finitely generated if there are elements a\, . . . , a n of L 
so that L = K(ot\, . . . , a n ). In other words, every element of L can be written 
as a rational expression in the a\,...,a n . 

Elements t\,...,tk of L are called (algebraically) independent (over K) if 
there is no non-trivial polynomial p with coefficients in K such that 



p(h, ...,t k )=0. 
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Lemma 15 If L : K is finitely generated, then there exists an intermediate field 
M such that M = K(a±, . . . , a r ) where the a\, . . . , a r are independent transcen- 
dental elements and L : M is a finite extension (i.e. L has finite dimension over 
M as a vector space). 

Proof. We are assuming that L = K{(3\, . . . ,(3 q ). If all the Pi are algebraic, 
then L : K is a finite extension. Otherwise one of the j3i is transcendental. 
Call this a.\. If L : K(oti) is a finite extension we are done. Otherwise one 
of the remaining f3i is transcendental over K(a\). Call it a 2 . So ai 7 a 2 are 
independent. Proceed. 

Lemma 16 // there is another collection 71 . . . 7 S so that L : AT (71, . . . , 7 S ) is 
finite then r = s. This common number is called the transcendence degree of L 
over AT. 

Proof. If ,s = 0, then every clement of L is algebraic, contradicting the assump- 
tion that the ai, . . . , a r are independent, unless r = 0. So we may assume that 
s > 0. Since L : M is finite, there is a polynomial p such that 

p(7i, an,..., a r ) = 0. 

This polynomial must contain at least one a, since 71 is transcendental. Renum- 
ber if necessary so that ct\ occurs in p. Then ct\ is algebraic over AT (71, 012, ■ • ■ , a r ) 
and L : K(ji,a 2} ■ ■ ■ , a r ) is finite. Continuing this way we can successively re- 
place as by 7s until we conclude that L : if (71, . . . , 7 r ) is finite. If s > r then 
the 7s are not algebraically independent, so s < r and similarly r < s. 

Notice that if at\, . . . , a n are algebraically independent then K(a\, . . . , a n ) 
is isomorphic to the field of rational functions in n indeterminates K(t\, . . . , t n ) 
since K(a\, . . . , a n ) = K(ct\, . . . , a n ^i)(a n ) by clearing denominators. 

11.2.2 Symmetric polynomials. 

The symmetric group S n acts on K(t\, . . . , t n ) by permuting the variables. The 
fixed field, F, contains all the symmetric polynomials, in particular the ele- 
mentary symmetric polynomials si,...,s n where s r is the sum of all possible 
distinct products taken r at a time. Using the general theory of field extensions, 
we can conclude that 

Proposition 32 F = K(si, . . . , s„). 

The strategy of the proof is to first show that the dimension of the extension 
K(t\, . . . , t n ) : K(s\, . . . , s n ) is < n\ and then a basic theorem in Galois theory 
(which we shall recall) says that the dimension of K : F equals the order of 
the group G — S n which is n\. Since K(si, . . . , s„) C F this will imply the 
proposition. Consider the extensions 



K{tx,.. . , t n ) D K(s!,.. .,s n ,t n ) D K(si,.. . ,s n ). 
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We have the equation f(t n ) = where 

f(t) := t n - s^- 1 + t n - 2 s 2 ■■■ + (-l) n s n . 

This shows that the dimension of the field extension K(s\, . . . , s n , t n ) : K(s\, . . . , s n ) 
is < n. If we let s[ . . . s' n _ 1 denote the elementary symmetric functions in n — 1 
variables, we have 

Sj t n Sj_-[ -\- Sj 

so 

K(sx, . . .,s n ,t n ) = K(t n ,s\, . . • ,s^_!). 
By induction we may assume that 

dimK(tu . . . ,t n ) : K(s 1 , . . . , s n ,t n ) = dimK(t n )(t 1} . . . , i„_i) : K(t n )(s' l , . . . , s' n _ x ) 

< (n-1)! 

proving that 

dimK(ti, ...,t n ) : K(si, ...,s n )< n\. 
A fundamental theorem of Galois theory says 

Theorem 20 Let G be a finite subgroup of the group of automorphisms of the 
field L over the field K, and let F be the fixed field. Then 

dim[L : F] = #G. 

This theorem, whose proof we will recall in the next section, then completes 
the proof of the proposition. The proposition implies that every symmetric 
polynomial is a rational function of the elementary symmetric functions. 

In fact, every symmetric polynomial is a polynomial in the elementary sym- 
metric functions, giving a stronger result. This is proved as follows: put the 
lexicographic order on the set of n— tuples of integers, and therefor on the set 
of monomials; so x 1 ^ ■ ■ ■ x l ™ is greater than xj 1 • • • xfc 1 in this ordering if i\ > j\ 
of i\ — ji and ii > ji or etc. Any polynomial has a "leading monomial" the 
greatest monomial relative to this lexicographic order. The leading monomial 
of the product of polynomials is the product of their leading monomials. We 
shall prove our contention by induction on the order of the leading monomial. 
Notice that if p is a symmetric polynomial, then the exponents i\, . . . , i n of its 
leading term must satisfy 

k > «2 > • • • > i n , 

for otherwise the monomial obtained by switching two adjacent exponents (which 
occurs with the same coefficient in the symmetric polynomial, p) would be 
strictly higher in our lexicographic order. Suppose that the coefficient of this 
leading monomial is a. Then 

q — asj s 2 s n _ 1 s n 

has the same leading monomial with the same coefficient. Hence p — q has a 
smaller leading monomial. QED 
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11.2.3 Fixed fields. 

We now turn to the proof of the theorem of the previous section. 

Lemma 17 Every distinct set of monomorphisms of a field K into a field L 
are linearly independent over L. 

Let Ai,...,A„ be distinct monomorphisms of K — > L. The assertion is that 
there can not exist ai, . . . , a n £ L such that 

aiAi(x) + • • • + a n \ n (x) = Vx G K 

unless all the a,i = 0. Assume the contrary, so that such an equation holds, and 
we may assume that none of the ai = 0. Looking at all such possible equations, 
we may pick one which involves the fewest number of terms, and we may assume 
that this is the equation we are studying. In other words, no such equation holds 
with fewer terms. Since Ai ^ A„, there exists a y G K such that X\{y) ^ X n (y) 
and in particular y ^ 0. Substituting yx for x gives 

a 1 X 1 (yx) H h a n X n (yx) = 

so 

aiAi(y)Ai(a;) H h a n X n (y)X n (x) = 

and multiplying our original equation by Ai(y) and subtracting gives 

d2{Xi{y) - X 2 (y))X 2 (x) H h a n (X n (y) - Ai(y))A„(x) = 

which is a non-trivial equation with fewer terms. Contradiction. 

Let n = 4^G, and let the elements of G be g\ = 1, . . . , g n . Suppose that dim 
L : F = m < n. Let Xi, . . . , x m be a basis of L over F. The system of equations 

9i{xj)yi H \-9n{xj)y„ = 0, j = 1, . . . ,m 

has more unknowns than equations, and so we can find non-zero yi,...,y n 
solving these equations. Any b <G L can be expanded as 

b = b 1 x l H h b m x m , bi G F, 

and so 

9i(b)yi-\ \- 9n(b)y n = ^2 / b j [g 1 {xj)y 1 -\ h g n (xj)y n ] 

j 

= 

showing that the monomorphisms <?i are linearly dependent. This contradicts 
the lemma. 

Suppose that dim L : F > n. Let x\, . . . ,x ni x n +i be linearly independent 
over F, and find yi, . . . , y n +\ G L not all zero solving the n equations 

9j{xi)yi H h gj{x n+1 )y n+1 = 0, j = l,...,n. 
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Choose a solution with fewest possible non-zero y's and relabel so that the first 
are the non- vanishing ones, so the equations now read 

9j(xi)yi H h 9j{x r )y r = 0, .7 = 1, ... ,n, 

and no such equations hold with less than r y's. Applying g e G to the preceding 
equation gives 

99j{xi)g{yi) + ■■■ + ggj{x r )g(y r ) = 0. 

But ggj runs over all the elements of G, and so g(yi), ■ ■ ■ ,g(y r ) is a solution of 
our original equations. In other words we have 

9j(xi)yi-\ \-gj{x r )y r = and 

9j(xi)g(yi) + --- + 9j(x r )g(y r ) = 0. 

Multiplying the first equations by g(yi), the second by y\ and subtracting, gives 

g 3 (x 2 )[y 2 g(yi) -5(3/2)2/1)] + ■ ■ ■ + gj{x r )[y r g(yi) - g{y r )yi] = 0, 

a system with fewer y's. This can not happen unless the coefficients vanish, i.e. 

Vi9{yi) = yig{yi) 

or 

ViVi 1 = giviVi 1 ) Vg e G. 

This means that 

Setting Zi = j/j/j/i and k = yi, we get the equation 

Xikzi + • • • + x r kz r = 

as the first of our system of equations. Dividing by k gives a linear relation 
among x\, . . . , x r contradicting the assumption that they are independent. 

11.2.4 Invariants of finite groups. 

Let G be a finite group acting on a vector space. It action on the symmetric 
algebra S(V*) which is the same as the algebra of polynomial functions on V 

by 

(gf)(v) = fig-'v). 

Let 

R=S(V*) G 

be the ring of invariants. Let S = S(V*) and L be the field of quotients of S, 
so that L — K(ti,...,t n ) where n = dimV. From the theorem on fixed fields, 
we know that the dimension of L as an extension of L G is equal to the number 
of elements in G, in particular finite. So L G has transcendence degree n over 
the ground field. 
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Clearly the field of fractions of R is contained in L G . We claim that they 
coincide. Indeed, suppose that p,q € S, p/q € L G . Multiply the numerator 
and denominator by Ilgp the product taken over all g € G, g ^ 1. The new nu- 
merator is G invariant. Therefore so is the denominator, and we have expressed 
p/q as the quotient of two elements of R. 

If the finite group G acts on a vector space, then averaging over the group, 
i.e. the map 

E3f~f* : ,',;»••/ 

is a projection onto the subspace of invariant elements: 

A: f^ f E — > E G . 

In particular, if E is finite dimensional, 

dimi; G = trA (11.3) 

We may apply the averaging operator to our (infinite dimensional) situation 
where S — S(V*) and R = S G in which case we have the additional obvious 
fact that 

(pq) S =ptq VpeS, qeR. 

Let R + C R denote the subring of R consisting of elements with constant term 
zero. Let 

/ := SR+ 

so that / is an ideal in 5*. By the Hilbert basis theorem (whose proof we recall 
in the next section) the ideal / is finitely generated, and hence, from any set of 
generators, we may choose a finite set of generators. 

Theorem 21 Let fi,...,f r be homogeneous elements of R + which generate I 
as an ideal of S. Then /i, . . . , f r together with 1 generate R as a K algebra. In 
particular, R is a finitely generated K algebra. 

Proof. We must show that any f £ R can be expressed as a polynomial in 
the /i, . . . , f r , and since every / is a sum of its homogeneous components, it is 
enough to do this for homogeneous / and we proceed by induction on its degree. 
The statement is obvious for degree zero. For positive degree, / e R C / so 

/ = Sl/l H h S r f r , Si € / 

and since f , f\, . . . , f r are homogeneous, we may assume the Si are homogeneous 
of degree deg / — deg /, since all other contributions must cancel. Now apply A 
to get 

/ = Ah + ■■■ + 4fr- 

The lie in R and have lower homogeneous degree than /, and hence can be 
expressed as polynomials in fi, . . . ,f r . Hence so can /. 
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11.2.5 The Hilbert basis theorem. 

A commutative ring is called Noetherian if any of the following equivalent con- 
ditions holds: 

1. If Ii C I2 C • • • is an ascending chain of ideals then there is a k such that 

Ik = Ik+1 = Ik+2 = 

2. Every non-empty set of ideals has a maximal element with respect to 
inclusion. 

3. Every ideal is finitely generated. 

The Hilbert basis theorem asserts that if R is a Noetherian ring, then so is 
the polynomial ring R[X\. In particular, all ideals in K[Xi, . . . , X n ] are finitely 
generated. 

Let / be an ideal in RX and for any positive integer k let L k (I) C R be 
defined by 

k 

L k (I) := {a k e _R|3a fc _!, . . . , a\ £ R with <= I}- 



For each k, L k (I) is an ideal in R. Multiplying by X shows that 

L k (I) C L k+1 (I). 

Hence these ideals stabilize. If 7 C J and L k (I) — L k (J) for all k, we claim 
that this implies that I = J. Indeed, suppose not, and choose a polynomial of 
smallest degree belonging to J but not to I, say this degree is k. Its leading 
coefficient belongs to L k (J) and can not belong to L k (I) because otherwise we 
could find a polynomial of smaller degree belonging to J and not to I. 
Proof of the Hilbert basis theorem. Let 

J C h C • • • 

be an ascending chain of ideals in R[X]. Consider the set of ideals L p (I q ). We 
can choose a maximal member. So for k > p we have 

L k (Ij) - L k (I q ) Vj>q. 

For each of the finitely many values j = 1 , . . . , p — 1 , the ascending chains 

ii(/o)Cii(/i)C- 

stabilizes. So we can find a large enough r (bigger than the finitely many large 
values needed to stabilize the various chains) so that 

Li(Ij) = Li(I r ) Vj >r, Mi. 

This shows that Ij = Ir Vj > r. 
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11.2.6 Proof of Chevalley's theorem. 

This says that if K = R and W is a finite subgroup of 0(V) generated by 
reflections, then its ring of invariants is a polynomial ring in n- generators, 
where n = dimV. Without loss of generality we may assume that W acts 
effectively, i.e. no non-zero vector is fixed by all of W . 

Let fi,...,f r be a minimal set of homogeneous generators. Suppose we 
could prove that they are algebraically independent. Since the transcendence 
degree of the quotient field of R is n — dim V, we conclude that r — n. So 
the whole point is to prove that a minimal set of homogeneous generators must 
be algebraically independent - that there can not exist a non-zero polynomial 
h = h(yi, . . . , y r ) so that 

fc(/i,...,/ r )=0. (11.4) 

So we want to get a smaller set of generators assuming that such a relation 
exists. Let 

di ■= deg/i, ... ,d r := deg/ r . 



For any non-zero monomial 
occurring in h the term 



ayt 1 ■ ■ ■ vl r 



we get by substituting /'s for y's has degree 

d = eidi + • • • e r d r 

and hence we may throw away all monomials in h which do not satisfy this 
equation. Now set 

hi ■= q — (A, • • • , f r ) 

so that hi G R is homogeneous of degree d — di, and let J be the ideal in 
R generated by the hi. Renumber fi,. ■ ■ ,f r so that hi, . . . , h m is a minimal 
generating set for J. This means that 

m 

hi = y]gjjhj, € R 

j=i 

for i > m (if m < r; if m = r we have no such equations). Once again, since the 
hi are homogeneous of degree d—di we may assume that each is homogeneous 
of degree di — dj by throwing away extraneous terms. 

Now let us differentiate the equation (11.4) with respect to x k , k = 1, . . . ,n 
to obtain 



E 



k = i, 
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and substitute the above expressions for hi, i > m to get 

k = 1, . . . , n. 




Set 

dfi . \ dfj . 

^ : =^+ L 9ii W . = l,...,m 

so that each is homogeneous with 

degK = di - 1 

and we have the equation 

hipx-\ h m p m = 0. (11.5) 

We will prove that this implies that 

Pie/. (11.6) 

Assuming this for the moment, this means that 

0^+22 9ji^- = sum i=1 f iqi 

j—'m+l 

where qt € S. Multiply these equations by Xk and sum over k and apply Euler's 
formula for homogeneous polynomials 



"dx k 

We get 



difi + d i9jifj = fi r i 



with degri > if it is not zero. Once again, the left hand side is homogeneous 
of degree d\ so we can throw away all terms on the right which are not of this 
degree because of cancellation. But this means that we throw away the term 
involving /i, and we have expressed /i in terms of /2, . . . , f r , contradicting our 
choice of /i, . . . , / r as a minimal generating set. 

So the proof of Chevalley's theorem reduced to proving that (11.5) implies 
(11.6), and for this we must use the fact that W is generated by reflections, 
which we have not yet used. The desired implication is a consequence of the 
following 

Proposition 33 Let h\,... ,h m G R be homogeneous with hi not in the ideal 
of R generated by h 2 , ■ ■ ■ , h m . Suppose that (11.5) holds with homogeneous ele- 
ments pi G S. Then (11.6) holds. 
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Notice that hi can not lie in the ideal of 5* generated h 2 , . . . h m because we can 
apply the averaging operator to the equation 

hi = k 2 h 2 + h k m h m ki e S 

to arrange that the same equation holds with fcj replaced by k\ £ R. 

We prove the proposition by induction on the degree of pi . This must be 
positive, since pi ^ constant would put hi in the ideal generated by the 
remaining hi. Let s be a reflection in W and H its hyperplanc of fixed vectors. 
Then 

spi — Pi = on H. 

Let £ be a non-zero linear function whose zero set is this hyperplane. With 
no loss of generality we may assume that the last variable, x n , occurs with 
non-zero coefficient in I relative to some choice of orthogonal coordinates. In 
fact, by rotation, we can arrange (temporarily) that £ = x n . Expanding out the 
polynomial spi — pi in powers of the (rotated) variables, we see that sgi — gi 
must have no terms which are powers of Xi, . . . , x„_i alone. Put invariantly, we 
see that 

spi -Pi = £r t 

where r, is homogeneous of degree one less that that of p,. Apply s to equation 
(11.5) and subtract to get 

£{hin H h rn r m ) = 0. 

Since £ ^ we may divide by I to get an equation of the form (11.5) with pi 
replaced by r\ of lower degree. So n € I by induction. So 

spi - Pi £ I. 

Now W stabilizes R + and hence / and we have shown that each w £ W acts 
trivially on the quotient of pi in this quotient space S/I. Thus p\ = Api = pi 
mod /. So pi £ I since p\ £ I. QED 



